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as "XT"). The invention describes for the first time that XT occurs in at least two isoforms ("XT-I", "XT-IT). The invention re- 
lates furthermore to the recombinant cloning and expression of human and rat XT-I and XT-II and discloses their DNA and protein 

^ sequences. The enzymes according to the invention can be used as therapeutic agents and as diagnostic markers, e.g. for the de- 
termination of enhanced proteoglycan biosynthesis, and as biochemical markers for determination of several pathological processes 

^ such as systemic sclerosis. 
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Xylosyltransferase and Isoforms thereof 



The invention relates to the isolation, purification and characterization of the enzyme 
xylosyltransferase (defined as "XT"). The invention describes for the first time that XT occurs 

5 in at least two isoforms ("XT-I", "XT-IT). The invention relates furthermore to the recombinant 
cloning and expression of human and rat XT-I and XT-II and discloses their DNA and protein 
sequences. The enzymes according to the invention can be used as therapeutic agents and as 
diagnostic markers, e.g. for the determination of enhanced proteoglycyn biosynthesis, and as 
biochemical markers for determination of several pathological processes such as systemic 

10 sclerosis. 

Background of the Invention 

Proteoglycans are polyanionic molecules widely expressed in animal cells and virtually every 
tissue. These abundant molecules are present in the extracellular matrix and on the cell surface 

15 and serve a wide range of functions. They are increasingly implicated as important regulators in 
many biological processes, such as extracellular matrix deposition, cell membrane signal 
transfer, morphogenesis, cell migration, normal and tumor cell growth and viral infection 
(Ruoslahti, 1989, J. Biol Chem. 264, 13369-1372; HerokUf aL, 1994, J Gen. Virol 75, 1211- 
1222). Proteoglycans mediate diverse cellular processes through interaction with a variety of 

20 protein ligands. In most of these bindings electrostatic interactions with the glycosaminoglycan 
chains attached to the core protein are involved (Kjellen & Lindahl, 1991, Anna. Rev. Biochern. 
60, 443-475). Thus, the biological activity of proteoglycans is intimately related to the 
glycosaminoglycan biosynthesis. 

25 The sulfated glycosaminoglycans chondroitin sulfate, heparan sulfate, heparin and dermatan 
sulfate are bound to the proteoglycan core protein by a xylose-galactose-galactose binding 
region (Kjellen & Lindahl, 1991, I.e.). UDP-D-xylose:proteoglycan core protein {3-D- 
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various species (Hoffmann et al, 1984, Connect Tissue. Res. 12, 15 1-164), and it was shown 
that the enzyme is secreted from the endoplasmatic reticulum into the extracellular space 
together with chondroitin sulfate proteoglycans (Kahnert et al., 1991, Eur. J. Clin. Chem. Clin. 
Biochem. 29, 624-625; Gotting et al., 1999, J. Invest. Dermatol 112, 919-924). However, the 
5 processes resulting in the release of XT from the endoplasmatic reticulum or the Golgi 
compartments and the role of XT in the extracellular matrix are not yet known . 

As XT is the initial step enzyme in the biosynthesis of the glycosaminoglycan linkage region 
and as it is secreted into the extracellular matrix to a great extent, XT activity was proposed to 

10 be a diagnostic marker for the determination of an enhanced proteoglycan biosynthesis and of 
tissue destruction (Weilke et al., 1997, Clin. Chem. 43, 45-51). XT activities in the synovial 
fluid were found to be significantly increased in chronic inflammatory joint diseases (Kleesiek 
et al., 1987, J. Clin. Chem. Clin. Biochem. 25, 473-481). Recent studies have shown that serum 
XT activity is a biochemical marker for the determination of fibrotic activity in systemic 

15 sclerosis (Gotting et al.,1999, I.e.; Gotting et al., 2000, Acta Derm. Venereol. 80, 60-61.). 

Up to now there was no success to isolate or characterize xylosyltransferase (XT), however 
some methods were described to measure and determine the activity of said enzyme from blood 
or body fluid samples of patients showing pathological effects such as scleroderma or chronic 

20 joint diseases fStoolmiller, 1972, J. Biol. Chem. 247, 3525-3532). The samples were incubated 
with UDP-[ l4 C]xylose and an appropiate acceptor. The incorporated radioactivity indicated the 
amount of XT activity. Acceptors used so far are proteoglycans, silk fibroin (Campbell et al., 
1984, Anal. Biochem. 137, 505-516) and several peptides (Bourdon et al., 1987, Proc. Natl. 
Acad. Sci. USA 84, 3194-3198). However, the all activity tests used herein did not allow precise 

25 determination of the lower XT activity in serum. A more specific acceptor protein is 

recombinant bikunin, the inhibitory component of human inter-a-trypsin inhibitor. Bikunin 
carries a single chondroitin, which is essential for the structure of the inhibitor. The chondroitin 
sulfate attachement site in the N-terminal region contains all elements responsible for 
recognition by XT. The complete recognition sequence is composed of the amino acids a-a-a-a- 

30 G-S-G-a-b-a, with a = E or D and b = G, E or D. This sequence was confirmed by 

determination of the Michaelis-Menten (K m ) constants for in vitro xylosylation of different 
synthetic proteins and peptides using an enriched XT preparation from conditioned cell culture 
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supernatant of human chondrocytes. The constant was determined to be 22 jaM, which was 
decreased 9-fold in comparison to deglycosylated core protein from bovine cartilage (188 \xM) 
(Brinkmann et al., 1997, J. Biol Chem. 272, 11171-11175). With recombinant bikunin as 
acceptor, a sensitive assay was developed that allows precise determination of XT activity in 
5 human serum and other body fluids (see: Weilke et al., 1997, lc). Using this assay an increased 
xylosyltransferase activity was determined in blood of patients with sclerotic processes of 
scleroderma, closely related to an elevated proteoglycan biosynthesis (Gotting et al., 1999, I.e.;. 

The biosynthesis of glycosaminoglycans requires the coordinated action of a large number of 
10 glycosyltransferases. Isolation and cloning of these glycosyltransferases has been targeted for a 
long time, since the majority of these enzymes are only present in minute amounts. The 
structure and sequence of the glycosyltransferases involved in biosynthesis of the common 
glycosaminoglycan-protein linkage region has long remained unknown. Recent cDNA cloning 
of galactosyltransferase I (Okajima et al., 1999, J. Biol Chem. 274, 22915-22918) and 
15 glucuruonyltransferase I (Kitagawa et al., 1998, J. Biol Chem. 273, 6615-6618) identified 2 of 
the at least 4 enzymes involved in synthesis of the GlcApi-3Gaipi-3Gaipi-4Xyipi-0-Ser 
structure. 

Isolation, purification and characterization of XT involved in biosynthesis of the common 
20 carbohydrate-protein linkage structure has been hampered by difficulties in obtaining a 
sufficient amount of the source material. Since, as above mentioned, XT can be used as an 
additional biochemical marker for the determination of sclerotic activity in systemic sclerosis 
and some inflammatory disorders, it is a real need for an isolated, highly purified XT which can 
be produced by recombinant methods, which is necessary for diagnostic and therapeutic 
25 purposes. Moreover, the knowledge of the cDNA sequence of XT allows to use it on gene level 
such as in gene diagnostic or gene therapy. 

Definitions 

• Above and below the term "XT" means xylosyltransferase (UDP-D-xylose:proteoglycan 
30 core protein p-D-xylosyltransferase (EC 2.4.2.26) deriving from any origin and includes all 
possible isoforms, if not otherwise pointed out. The term "hXT" has the meaning of human 
XT; the term "rXT" of rat XT. The terms "XT-I" , "XT-IT, "hXT-T, "hXT-IT, H rXT-I2, 
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"rXT-II" mean the specific isoforms 1 and 2 of XT according to the invention, wherein h 
and r have the indicated meanings. 

• Above and below the term "protein" means a protein, a protein fragment or a peptide, if not 
otherwise explained. 

5 • Above and below the term "recombinant protein" is defined as a protein which was 
produced by recombinant and biotechnological methods. 

• Above and below the term "XT" or "XT protein(s)" has the meaning of (a) protein(s) 
deriving from any source, if not otherwise stated out, having the biological activity and / or 
function of UDP-D-xylose proteoglycan core protein fi-D-xylosyltransferase. 

10 • Above and below the term "a protein having the biological activity of UDP-D- 

xylose:proteoglycan core protein 6-D-xylosyltransferase " is defined as a protein which has 
XT - identical or XT - like functions and activities and comprises XT itself and possible 
mutations, variants, isoforms thereof, including insertions, deletions and substitutions of one 
ore more amino acids. The term includes also fragments or longer forms of XT as well as 

15 dimeric or multimeric forms thereof showing XT functions. 

• Above and below the term "isoform" means a second naturally occurring enzyme having the 
same biological activity as a first naturally occurring enzyme, however differing by another 
amino acid sequence. 

20 Summary of the Invention 

It was found that XT can be purified 4,700-fold with 1% yield from serum-free JAR 
choriocarcinoma cell culture supernatant. The isolation procedure includes a combination of 
ammonium sulfate precipitation, heparin affinity chromatography, ion exchange 
chromatography and protamine affinity chromatography. Amongst other proteins an unknown 

25 protein was detected by matrix-assisted laser desorption ionization mass spectrometry-time of 
flight analysis (MALDI-TOF) in the purified sample. The molecular weight of this isolated and 
purified XT protein was determined as 120.000 by SDS-polyacrylamide gel electrophoresis. 
The isolated protein was enzymatically cleaved by trypsin and endoproteinase Lys-C. Peptide 
fragments were sequenced by Edman degradation. Searches with the amino acid sequences in 

30 protein and EST databases showed no homology to known sequences. XT was enriched by 
immunoaffinity chromatography with an immobilized antibody against a synthetic peptide 
deduced from the sequenced peptide fragments and was specifically eluted with the antigen. In 
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addition, XT was purified alternatively with an aprotinin affinity chromatography and was 
detected by western blot analysis in the enzyme-containing fraction. 

Based on the partial amino acid sequence of their isolated and purified new enzyme (XT) 

5 derived from human JAR choriocarcinoma cell culture supernatant a novel cDNA was isolated 
according to the invention, encoding human XT-I enzyme using the degenerate reverse 
transcriptase-polymerase chain reaction method. The enzyme belongs to a novel family of 
glycosyltransferases having no homology to proteins of prior art 5'- and 3'-RACE were 
performed to isolate a novel cDNA fragment of 3726 bp with a single open reading frame 

10 encoding at least 827 amino acids with a molecular mass of 91 .000. The human XT-I gene was 
located on chromosome 16pl3.1 using radiation hybrid mapping, and extracts from CHO-K1 
cells transfected with the XT-I cDNA in an expression vector exhibited marked XT activity. 
Furthermore, a new 3608-bp cDNA fragment encoding a novel protein of 865 amino acids was 
also isolated by PCR using degenerate primers based on the amino acid sequence of human XT- 

15 L The amino acid sequence of this XT-II isoform displays 55% identity to the human XT-I. The 
XT-II gene is located on chromosome 17q21.3-17q22, and the exon/intron structure of the 15 
kb gene was determined. RT-PCR analyses of XT-I and XT-II mRNA from various tissues 
confirmed that both XT-I and XT-II transcripts are ubiquitously expressed in the human tissues, 
although with different levels of transcription. Furthermore, the cDNAs encoding XT-I and XT- 

20 II from rat were cloned. The deduced amino acid sequences of rat xylosyltransferases displayed 
94% identity to the corresponding human enzyme. 

Therefore, it is an object of this invention to provide the following subject-matters: 

• An isoform of UDP-D-xylose:proteoglycan core protein Ji-D-xylosyltransferaseQCT); 

25 • a protein comprising a sequence of said isoform or a fragment thereof, having the biological 
activity of XT; 

• a corresponding protein deriving from human or rat sources (hXT, rXT); 

• a corresponding protein isolated from specific human tissue, wherein said hXT has a 
molecular weight of 120.000 under SDS PAGE conditions; 

30 • a corresponding recombinant protein, wherein said protein is hXT-I comprising at least 827 
amino acids and having the amino acid sequence as depicted in Fig. 7B; 
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• a corresponding isoform of hXT-I, termed as hXT-II, comprising 865 amino acids, 
exhibiting approximately 55% overall sequence identity to human hXT-I, and having, in 
more detail, the amino acid sequence as depicted in Fig. 8B. 

• a process for isolating and purifying a protein having the biological activity of human UDP- 
5 D-xylose:proteoglycan core protein B-D-xylosyltransferase (EC 2.4.2.26), said process is 

characterized by the following steps: 

(i) culturing from human tissue showing an enhanced XT activity, preferably 
from JAR choriocarcinoma cells (ATCC HTB-144), and harvesting the cell 
culture supernatant, 

10 (ii) fractionated ammonium sulfate precipitation of the supernatant of step (i), 

(iii) heparin affinity chromatography of the precipitate of step (ii), 

(iv) ion exchange chromatography of the step (iii) product, and 

(v) affinity chromatography of the step (iv) product, and optionally 
(vi a SDS-Polyacrylamide gel elektrophoresis of step (v); 

15 • recombinant forms of hXT and rXT, the corresponding DNA sequences (Fig. 7A, 8A, 9, 10) 
included as well as suitable expression vectors and expression host cell systems; 

• antibodies directed against any of the above or below mentioned XT proteins and their uses 
in immunological assays and diagnostic tools for determining said XT proteins; 

• pharmaceutical compositions comprising a XT protein as defined above and below, 
20 optionally together with a suitable pharmacologically acceptable carrier, diluent or 

excipient; 

• uses of said XT proteins for the manufacture of a medicament for the treatment of XT 
relevant diseases and disorders, wherein the xylosyltransferase enzymes according to the 
invention can either be used directly as therapeutic drug in pathological situations where a 

25 deficiency of said enzyme and its isoforms can be detected; 

• uses of said XT enzymes for the manufacture of a medicament for the treatment of diseases 
and disorders which are caused or accompanied by increased levels of said enzymes, e.g. in 
sclerotic diseases and chronic joint diseases, wherein said medicament is an inhibitor or 
antagonist of said XT proteins; 

30 • uses of said XT proteins as diagnostic markers for the diagnosis of above- and below- 
mentioned diseases and pathological symptoms and uses of said DNA molecules as gene 
markers; 
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• methods of screening for compounds which are capable to inhibit the activity of said XT 
proteins according to the invention using known methods for determining the activity of 
XT. 

5 Detailed Description: 

General 

UDP-D~xylose: proteoglycan core protein fi-D-xylosyltrartsferase can be isolated and purified 
from JAR choriocarcinoma cell culture. The isolated protein according to the invention is a 
single stranded polypeptide with a molecular weight of 120.000. The protein was enzymatically 

10 cleaved and eleven peptide fragments were sequenced by Edman degradation (see Examples). 
XT is present only in very small amounts in animal tissues but unlike other 
glycosyltransferases, more than 90% of XT is enriched in the medium of cultured cells. The 
highest secretion of XT activity was measured in JAR choriocarcinoma cell culture, in which 
sternal cartilage chondrocytes and 21 different human cell lines were examined. To produce a 

15 highly enriched XT solution for the isolation of XT, JAR choriocarcinoma cells were adapted to 
hollow fiber culture conditions using a novel bioreactor (TECNOMOUSE) and Ultradoma-PF 
medium without serum addition as nutrient. For purification of XT a combination of classic 
separation methods and new affinity matrices was employed. Therefore, a heparin matrix was 
used as an affinity ligand for the XT. When applied to immobilized heparin, XT was 

20 completely adsorbed at the matrix and the XT activity was eluted only with a high salt 

concentration after most contaminating proteins have been removed from the matrix. Protamine 
chloride is well-known as cationic activator for several sulfotransferases, so the effect of 
protamine chloride on the XT was investigated. An increased XT activity was measured when 
protamine chloride was added to the XT assay solution (see Examples), indicating an 

25 interaction of these arginine rich proteins with XT. Therefore, a protamine chloride affinity 
matrix was synthesized using an aldehyde activated perfusion medium as support. The 
interaction of XT with this affinity matrix resulted in a 13.6 times enrichment of XT. The 
protamine chloride affinity chromatography was the most efficient purification step during the 
isolation of the XT. Immobilized aprotinin, a Kunitz-type proteinase inhibitor, was found to be 

30 another appropriate affinity matrix for enrichment of XT, as it was able to adsorb XT 
quantitatively. Therefore, it was used for alternative purification of XT. Different lines of 
evidence showed that the isolated protein corresponds to the XT: (a) XT activity was enriched 
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using immobilized antibodies raised against a synthetic peptide deduced from the 120.000 
protein, and the XT activity could be competitively eluted with this peptide, (b) Immunoblot 
analysis of aprotinin affinity purified XT corresponds with the 120.000 protein. However, non- 
reducing and non-denaturating gel filtration chromatography with heparin affinity-purified XT 
5 from JAR cell culture supernatant shows an additional peak of XT activity at a molecular 
weight of approximately 500.000. XT is associated with proteoglycans. Treatment of XT with 
N-glycosidase F resulted in a decrease of the molecular weight from 120.000 to 1 16.000, 
suggesting that the XT is a glycoprotein. A comparison of the molecular mass of XT with other 
glycosyltransferases involved in biosynthesis of proteoglycans shows that the XT is larger than 

10 the other enzymes. Another difference is that nearly all proteoglycan glycosyltransferases are 
tightly bound to the membrane of the endoplasmic reticulum, whereas XT is secreted into the 
extracellular space. XT according to the invention contains like many glycosyltransferases a 
DxD motif, suggesting that this motif is involved in binding the metal-ion cofactor and the 
donator substrate (Gastinel et al., 1999, EMBOJ. 18, 3546-3557). A DxD sequence was also 

15 found in peptide 8 obtained from the enzymatically cleaved XT. 

The invention discloses for the first time the molecular cloning and expression of human and 
rat XT. Surprisingly, XT is found in at least two isomeric forms, which are termed according to 
the invention as XT-I and XT-II. Based on the amino acid sequence a novel cDNA was cloned, 

20 which encodes a protein of at least 827 amino acids with a molecular mass of 91.000. A DxD 
motif was identified in the XT-I amino acid sequence using hydrophobic cluster analysis. This 
motif has been observed in many glycosyltransferases, and is involved in the coordination of a 
divalent cation in the binding of the nucleotide-sugar (Breton & Luberty, 1999, Curr. Opin. 
Struct Biol 9, 563-571.). The translation initiation codon of the XT-I cDNA could not be yet 

25 cloned probably due to strong secondary structures in the 5 '-region of the XT-I mRNA. The 
XT-I protein isolated from human JAR choriocarcinoma cell culture supernatant migrated on 
SDS-PAGE with a molecular mass of 120.000 and the protein size could be further reduced 
after N-glycanase digestion. These findings indicate that the cloned cDNA represents at least 
90% of the coding region of human XT-I. 

30 

Another cDNA was completely identified from human placenta RNA, which was similar but 
not identical to the hXT-I cDNA. The new cDNA encodes a protein of 865 amino acids with a 
molecular weight of 97.000. This novel protein termed XT-I has a proline-rich region located 
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near the amino-terminus and the type II transmembrane topology characteristic of many other 
glycosyltransferases cloned to date. The hXT-II protein was similar to the human XT-I with an 
overall sequence identity of 55%. The similarity of the predicted amino acid sequences was low 
(<10%) at the amino-terminal end. However, large stretches at the C-terminal region, where the 
5 catalytic domain was found to be located in glycosyltransferases, are very conserved with an 
identity of more than 80% in both proteins. These findings let conclude that the XT-EL gene 
encodes another human xylosyltransferase, although the catalytic activity and the biological 
role of XT-II remain to be elucidated in detail. 

10 Since alterations in XT activity have been reported to be associated with fibrotic and sclerotic 
alterations of connective tissue (Gotting et al, 1999; Gotting et al, 2000, I.e.), the present 
findings provide molecular tools to study the function and the regulated expression of human 
XT as well as the molecular mechanisms of these diseases. 

15 Production of XT 

JAR choriocarcinoma cells which had been adapted to growth in the serum-free Ultradoma-PF 
medium secreted XT activity into the cell culture supernatant. During the exponential growth 
XT activity and total protein concentration in the supernatant of a traditional cell culture system 
(T-flasks) were determined as 0.2 mUA and 0.1 g/1, respectively. The cells were cultivated 

20 using three "Tecnomouse"® bioreactor systems. Each bioreactor contained five culture 

casettes. About 10 7 cells per culture casette were inoculated and medium probes of about 0.5 ml 
were taken every day to determine the viability of the cells and the glucose, and lactate 
concentration as well as the XT activity of the cell culture supernatant. The cells amounted in 
the probes varies from 10 5 to 10 7 cells per ml with viability between 40 and 70%. XT 

25 production increased within three weeks after cell inoculation, reaching a plateau of 

approximately 30 mU/1. After three months the culture cassette was removed. Harvesting was 
carried out every two days, collecting 10 ml cell culture supernatant per culture cassette. The 
cell-free supernatant was collected and stored at -75°C. Mean XT activity in the supernatant of 
the high density culture was determined as 29.0 mU/1, while the total protein concentration was 

30 estimated as 4.8 g/1. In total 18.5 1 high density cell culture supernatant was collected and 
yielded 535.8 mU XT. 
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Isolation and purification of XT from cell culture supernatant 

XT was purified form 18.5 1 supernatant (equivalent to 2.000 1 normal cell culture supernatant) 
of serum-free cultivated JAR choriocarcinoma cells to an apparent homogeneity of about 4700- 
fold purification. A summary of the purification steps for isolation of XT is shown in Table 1. 
5 XT activity of the crude supernatant was approximately 0.006 mU/mg protein. The purification 
method according to the invention comprises four, preferably five different steps. It is possible, 
that one step can be achieved more than once if necessary. 

Step 1: Fractionated ammonium sulfate precipitation XT of the ammonium sulfate precipitable 
fraction was dissolved in 0.46 1 buffer A with solubilization of 79.5% of the original activity. 

10 Step 2: Heparin affinity chromatography on POROS 20 HE 4 ml of XT-enriched solution from 
step 1 was loaded onto the POROS 20 HE column. XT activity was completely retained on the 
column. More than 70% of total protein passed through the column. Contaminating protein was 
eluted at a low NaCl concentration. 44% of the XT activity bound to the heparin matrix 
emerged at 0.5 M NaCl (Fig. 1 A). 

15 Step 3: Ion exchange chromatography on POROS 20 HQ 4 ml of the desalted XT-containing 
fraction from step 2 was loaded onto the POROS 20 HQ column equilibrated in buffer A. More 
than 98% of the XT activity bound to the resin. The column was then eluted stepwise with NaCl 
in buffer A (Fig. IB). XT-containing fractions were collected. 

Step 4: Affinity chromatography on protamine chloride The product of step 3 was desalted and 
20 concentrated using dia- and ultrafiltration. 1 00 \x\ of the protein solution was applied to the 
POROS protamine chloride column previously equilibrated with buffer A. Approximately 95% 
of the transferase activity bound to the column, whereas 75% of the contaminating protein did 
not. Additional proteins were eluted with buffer A containing low NaCl concentrations. 
Enzyme activity was eluted at approximately 0.15 MNaCl (Fig. 1C). The enzyme activity 
25 was stable for at least 6 months at -75°C. 

Step 5: SDS-Polyacrylamide gel electrophoresis (SDS-PAGE) XT-containing fractions from 
step 1-4 were subjected to SDS-PAGE on a 4-12% gradient polyacrylamide gel (Fig. 2, panel 
A). Coomassie-stained protein bands were excised and characterized by MALDI-TOF mass 
spectrometry after tryptic digestion. The molecular weight of an unknown protein was 
30 determined as 120.000 (Fig. 2, panel B). 
Step 5 is an optionally step. 
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Table 1: 

Summary of single purification steps employed for isolation of XT from 18.5 1 of high density JAR 



cell culture supernatant. 



Step 


Volume 


Total activity Total protein 


Spec, activity Purification 


Recovery 




ml 


10-3 x units 


mg 


10-3 x units/mg 


-fold 


% 


JAR high density cell culture 












Supernatant 


18,500 


535.8 


89,355.0 


0.006 


1 


100 


Ammonium sulfate 














precipitation 


460 


426.0 


8,937.8 


0.048 


8 


79 


Heparin affinity 














chromatography 


50 


108.3 


473.0 


0.229 


40 


35 


Ion exchange 














Chromatography 


5 


91.0 


43.1 


2.090 


348 


17 


Protamine affinity 














chromatography 


1 


6.83 


0.24 


28.458 


4,743 


1 



5 Amino acid sequence analysis of XT. The MW 120.000 protein from the excised band was 
digested with trypsin and endoproteinase Lys-C. The proteolytic fragments were separated by 
re versed-phase HPLC, and selected peptides were subjected to automatic amino acid 
sequence analysis: Table II shows the obtained 1 1 amino acid sequences determined by Edman 
degradation and mass spectrometry. 

10 Table II . X represent a not identified residue. The masses of three peptides were observed in the 
MALDI mass spectrum of the enigmatically digested MW 120.000 protein. Calculated mass values 
were obtained from the sequence obtained. 



Peptide 


observed mass 
(M + H 4 ) 


calculated mass 
(M+H*) 


(l)ELG AK 






(2)ELLK 






(3)DMNFLK 






(4)IASPPSDFGR 


1045.5 


1045.5 


(5) L L L D 






(6)DFENVDNSNFAPR 


1524.7 


1525.7 


(7) P T F F A R 






(8)LQFSE VGTDXD A 






(9)ELGA VKPDGRL 


1 152.6 


1153.7 


(10) ELLKRKLEQQEK 






(ll)LGLLMPEK 
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Immunochemical detection of XT 

Polyclonal antibodies against the synthetic peptide CSRQKELLKRKLEQQEK deduced from 
the peptides 2 and 10 of the enzymatically cleaved MW 120.000 protein were covalently bound 
on POROS 20 PA. About 50% of the XT activity of an applied sample was bound (Fig. 3, panel 

5 A) when a partially purified XT sample obtained by heparin affinity chromatography 

(purification step 2) was loaded onto the column. 58% of the adsorbed XT activity was eluted 
with 1 50 mM NaCl, and the rest was eluted with 12 mM HC1. Furthermore, the adsorbed XT 
activity was also eluted from the solid phase when 100 mg/ml) of the synthetic peptide 
was added to the mobile phase (Fig. 3, panel C). When immobilized preimmune serum was 

to used as affinity matrix (negative control) no XT activity was adsorbed to or eluted from 
the matrix (Fig. 3, panel B). The desalted XT fraction after heparin affinity chromatography 
(purification step 2) was loaded on an aprotinin affinity column. The elution profile shows four 
major protein peaks (Fig. 4, panel I). 61% of the XT activity adsorbed to the aprotinin matrix 
emerged at 0.30 M NaCl and another 21% at 0.54 M NaCl. A single MW 120.000 band of the 

15 XT-containing fractions was detected by western blot analysis with the polyclonal antibodies 
(Fig. 4, panel B). 

Determination of the molecular weight of XT 

100 ^1 of heparin affinity purified XT was separated under non-reducing and non- 
20 denaturating conditions using a TSK G3000 SW column. Two XT activity maxima were 
detected at MW 500.000 and 120.000 (Fig. 5, panel A). The molecular mass of the MW 
120.000 protein was reduced about 3 % after N-glycosidase F digestion as shown by SDS- 
PAGE (Fig. 5, panel B). 

25 PCR-based cloning of human XT-I cDNA 

Based upon the amino acid sequence of 4 peptides degenerate primers were designed for 
cloning the XT-I cDNA (Figure 6). When primers SPPS1 and Lysc-inv were used in a PCR 
with the first strand cDNA of SW1353 chondrosarcoma poly(A) + mRNA as template, a major 
band of approximately 690 bp was observed. After subcloning and sequencing a previously 

30 unknown DNA sequence was obtained from clone pCGl 14-29. PCR amplification with the 
primers DF1 and a mixture of Inv2b and Inv2c resulted in a 1724 bp fragment, which was 
subcloned and sequenced. The deduced amino acid sequence of the clone pCGl 1 1-4 was 
identical to 6 of the sequenced peptides from human XT-I. The cloning strategy of rapid 
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amplification of cDNA ends (RACE) (Chenchik et al, 1996, BioTechniques 21, 526-534) was 
employed to clone the complete coding sequence of the XT-I cDNA. The largest DNA 
fragment obtained from the 3'-RACE reaction was 1.6 kb and consisted of a 3* -untranslated 
region of 1240 bp. The 5' RACE reaction was performed with different primers derived from 

5 the nucleotide sequence of XT-I cDNA, but the 5 '-untranslated region of human XT-I cDNA 
could not be cloned using this method. All DNA fragments obtained contained just the known 
cDNA sequence and an additional 80 bp of coding sequence. Thus, a PCR-based screening 
approach using cDNA libraries as template was employed for cloning of the 5' -untranslated 
region. All DNA fragments obtained from the screening of 3 different human cDNA libraries 

10 stopped at the same nucleotide, indicating that stable secondary structures of the XT-I mRNA 
prevent the synthesis of cDNA of the 5' untranslated region during the reverse transcription 
reaction. However, the translation initiation codon of human XT-I could not be yet cloned. The 
combined cDNA of human XT-I contained 3726 bp (Fig. 7 A) with a single open reading frame 
encoding at least 827 amino acids with a molecular mass of a least 91 .000. The deduced amino 

15 acid sequence contained 3 potential N-glycosylation sites (Figure 7B). Analysis of the amino 
acid sequence using the hydrophobic cluster analysis (Gaboriaud et al 9 1987, ) revealed the 
presence of a common DxD motif at position 1 82, which has been shown to be essential for 
binding nucleotide-sugars in glycosyltransferases . 

20 PCR-based cloning of human XT-H isoform cDNA 

The degenerate primers PFF-sense and Lysc-invl which were designed upon the amino acid 
sequence of proteolytic cleaved peptides of human XT-I were used in a PCR reaction with first 
strand cDNA of placenta poly(A) + mRNA as template. The PCR amplification resulted in a 
minor band of 1.1 kb, which was cloned and sequenced. The determined nucleotide sequence 

25 was similar but not identical to the XT-I cDNA sequence, indicating that the fragment encodes 
a XT-II isoform. To clone the complete coding sequence of the novel cDNA, the RACE 
strategy was employed. The cDNA finally obtained was 3608 bp (Figure 8 A) and contained a 
single open reading frame encoding a protein of 865 amino acids with a molecular mass of 
97.000. The 3' -untranslated region is 850 bp and a 5 '-untranslated region of 149 bp was 

30 identified with an in-frame stop codon upstream of the ATG codon. A Kyte-Doolittle 

hydropathy analysis (Kyte & Doolittle, 1982, J. Mol Biol 157, 105-132) of the deduced amino 
acid sequence revealed one potential membrane-spanning region consisting of 16 hydrophobic 
amino acid residues at position 16 to 32, which appears to result in a type II transmembrane 



WO 01/49831 



PCT/EP00/13311 



- 14 - 

orientation characteristic of many of the glycosyltransferases. The predicted amino acid 
sequence contains a proline-rich profile pattern from position 110 to 118 and 3 potential N- 
glycosylation sites (Figure 8B). The human XT-II isoform protein exhibits 55% overall 
sequence identity to the human XT-I including stretches in the XT-II protein with more than 
5 80% homology to the human 
XT-I. 

Cloning of rat XT-I and XT-II cDNA 

For amplification of the XT-I and XT-II cDNA from rat tissue gene-specific primers were 
10 designed based upon the nucleotide sequence of human XT-I and XT-II cDNA. Theses 

oligonucleotides were employed in PCR, 5' RACE and 3' RACE reactions with rat liver cDNA 
as template and a 2593 bp cDNA fragment (Fig. 9) coding for rat XT-I was obtained. 5' RACE 
reactions using a diversity of gene-specific primers were performed for identification of the 5 s 
end of the XT-I cDNA. All cDNA fragments obtained stopped at exactly the same nucleotide 
15 position as observed in the human XT-I cDNA, indicating that stable secondary structures of 
the XT-I mRNA inhibit cDNA synthesis of the 5' region. The XT-II cDNA fragment finally 
obtained after PCR amplification with rat liver cDNA was 2782 bp (Fig. 10) and included the 
entire coding region of rat XT-II. The deduced amino acid sequences of XT-I and XT-II (SEQ 
IDs 6, 8) from rat each displayed 94% identity to the corresponding human xylosyltransferase. 
20 Alignment of all amino acid sequences revealed the presence of highly conserved amino acid 
clusters in the central and carboxyterminal regions of the proteins. 

Identification of the chromosomal localization of the human XT-I and XT-II genes 
The gene coding for human XT-I could be assigned to the short arm of chromosome 16 at the 
25 region 1 6p 1 3 . 1 . The radiation hybrid mapping using the Genebridge 4 radiation hybrid 
screening panel located the position of the PCR fragment 5.98 cR distal to the STS marker 
CHLC.GATA42E1 1 and 4.40 cR proximal to marker D16S499. The XT-II gene was identified 
on chromosome 17 at the position 17q21.3-17q22 and was found to be located distal to the STS 
marker WI-1 1424 and proximal to the marker WI-143 15. 

30 

Expression of recombinant XT 

To prove the function of the cDNA product, a recombinant form of XT-I was generated by 
fusing the cloned XT-I to an aminoterminal peptide tag. The fused protein was expressed in 
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CHO-K1 cells and absorbed from the medium by immunoprecipitation with anii-Xpress- 
antibodies and protein G agarose beads to eliminate endogenous XT activity. The enzyme- 
bound beads were used as an enzyme source and assayed for XT activity as shown in Table 3 
(see below). No detectable XT activity was recovered by the affinity purification from a control 

5 transfection sample. The substrate specificity of the recombinant XT-I was similar to that of the 
XT-I isolated from human body fluids and cell culture supernatants. The recovered enzyme 
activity of the recombinantly expressed XT-I could be completely inhibited by addition of 250 
U of heparin. As specific inhibition of human XT activity by heparin has been demonstrated 
previously (Kleesiek et aL, 1987, La), these results clearly indicate that the expressed protein is 

10 the human XT. However, no enzymatic transfer of xylose to the acceptor peptides used in this 
study was observed when expressing XT-II fused to the aminoterminal peptide tag in CHO-K1 
cells. 

To identify the XT-I reaction products, the bikunin peptide was labeled with [ I4 C]-D-xylose 
15 using the XT-I-bound beads as an enzyme source. The products were subsequently subjected to 
the linkage-specific digestion of the bound [ 14 C]-D-xylose with a- and p-xylosidase and 
alkaline p -elimination. Incubation of the reaction products with p-xylosidase resulted in the 
release of 74% of the incorporated [ l4 C]xyiose, whereas only less than 4% of the peptide-bound 
xylose was digested after treatment with a-xylosidase. The alkaline cleavage of the 0- 
20 glycosidic linkage between the xylose and the p-hydroxyamino acid serine in the presence of 
borohydride lead to the liberation of more than 97% of the enzymatically transferred 
[ 14 C]xylose. In all the experiments performed no significant amount of [ 14 C]-D-xylose was 
incorporated without the addition of the bikunin peptide as acceptor. These results clearly 
indicate that a xylose residue was transferred to the hydroxy amino acid serine of the bikunin 
25 peptide through a p-linkage. In conclusion the expressed protein was identified as UDP-D- 
xyloserproteoglycan core protein p-D-xylosyltransferase (EC 2.4.2.26). 
Table 3: 

Xylosyltransferase activity of recombinant XT-I expressed in CHO-K1 cells. XT activity of the 
enzyme fractions using different acceptors and the inhibition of enzyme activity by addition of 
30 heparin is shown. The synthetic bikunin, L-APP and L-APLP2 homologous peptides have 

been previously proved to be good acceptors for XT mediated xyiosylation (Brinkmann et aL, 
1997; Gotting et aL, 1998). No XT activity was detected in samples from mock-transfected 
cells after affinity purification, n.d., not detected (detection limit, 20 nU/1). 
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Acceptor 



XTActivitiy 



pCG227-XT 



+ heparin 



Recombinant bikunin 

Bikunin peptide QEEGSGGGGQK 

L-APLP2 peptide SENEGSGMAEQK 

L-APP peptide TENEGSGLTNIK 

Peptide SGG 

Chondroitin sulfate A 

Chondroitin sulfate C 



2854 
463 
515 
492 
n.d. 
n.d. 
n.d. 



n.d. 
n.d. 
n.d. 
n.d. 
ad. 
n.d. 
n.d. 



Tissue-specific expression ofXT-I and the XT-II isoform 

The expression of XT-I and XT-II isoform gene was examined in various human tissues using a 
RT-PCR based method with normalized first-strand cDNA. Each PCR yielded a single product 

5 with predicted nucleotide lengths of 490 bp for XT-I and 7 1 7 bp for XT-II, although the amount 
of the amplified product varied (Figure 1 1). Amplification of XT-I and XT-II fragments 
resulted in a product visible by ethidium bromide staining after 36 cycles, whereas the DNA 
fragment corresponding to the abundant glyceraldehyde-3-phosphate dehydrogenase transcript 
was visible after 20 cycles. The greatest abundance of XT-I expression was detected in the 

10 placenta, kidney and pancreas and only a very weak expression was detected in skeletal muscle. 
The greatest abundance of XT-II isoform is expressed in the kidney and pancreas. 

Cellular distribution of XT activity in cultured CHO-K1 cells 

After an incubation period of 3 days cultured CHO-K1 cells were harvested and the XT activity 
15 was determined in the spent culture supernatant and the cell lysates. 92% of the total XT 
activity was found to be located in the cell culture medium (93.1 mU/10 6 cells, SD 9.58), 
whereas only 2% was detected in the cell lysates (2.03 mU/10 6 cells, SD 0.44). 6% of the total 
XT activity (5.82 mU/10 6 cells, SD 2.18) was released from the membrane-containing fractions 
after addition of the detergence Triton X-100 indicating that less than l/10th of XT is bound to 
20 cellular membranes. 

Pharmaceutical and diagnostic use 

The xylosyltransferase enzymes according to the invention can be used directly as therapeutic 
drug in pathological situations where a deficiency of said enzyme and its isoforms can be 
25 detected. As pointed out above XT enzyme may be overexpressed in some diseases and 
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disorders. In these cases inhibitors and / or antagonists of XT-I and or XT-II can be used. Thus, 
the invention relates also to the use for the manufacture of a medicament for the treatment of 
diseases which are caused by increased levels of said enzymes, wherein said medicament is an 
inhibitor or antagonist of xylosyltransferase. 

5 As mentioned above the protein according to this invention can be used as diagnostic means to 
evaluate pathological conditions. As used herein, the term "pharmaceutically acceptable carrier 1 ' 
means an inert, non toxic solid or liquid filler, diluent or encapsulating material, not reacting 
adversely with the active compound or with the patient. Suitable, preferably liquid carriers, axe 
well known in the art. The formulations according to the invention may be administered as unit 

10 doses containing conventional non-toxic pharmaceutically acceptable carriers, diluents, 
adjuvants and vehicles which are typical for parenteral administration. 
The term "parenteral" includes herein subcutaneous, intravenous, intra-articulax and 
intratracheal injection and infusion techniques. Also other administrations such as oral 
administration and topical application are suitable. Parenteral compositions and combinations 

15 are most preferably adminstered intravenously either in a bolus form or as a constant fusion 
according to known procedures. Tablets and capsules for oral administration contain 
conventional excipients such as binding agents, fillers, diluents, tableting agents, lubricants, 
disintegrants, and wetting agents. The tablets may be coated according to methods well known 
in the art. 

20 Unit doses according to the invention may contain daily required amounts of the protein 

according to the invention, or sub-multiples thereof to make up the desired dose. The optimum 
therapeutically acceptable dosage and dose rate for a given patient (mammals, including 
humans) depends on a variety of factors, such as the activity of the specific active material 
employed, the age, body weight, general health, sex, diet, time and route of administration, rate 

25 of clearance, enzyme activity (units/mg protein), the object of the treatment, i. e., therapy or 
prophylaxis and the nature of the disease to be treated. 

Therefore, in compositions and combinations in a treated patient (in vivo) a pharmaceutical 
effective daily dose of the protein of this invention (hXT, hXT-I, hXT-II) is between about 0.01 
and 100 mg/kg body weight (based on a specific activity of 100 kU/mg), preferably between 
30 0.1 and 10 mg/kg body weight. According to the application form one single dose may contain 
between 0.5 and 10 mg of hXT. 
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Short Description of the Figures 
Figure 1: Purification of XT 

Panel A. Heparin affinity chromatography on POROS 20 HE: The dissolved ammonium sulfate 

5 precipitate from JAR choriocarcinoma supernatant was applied to a POROS 20 HE column. 

After equilibration with buffer A, the column was eluted stepwise with NaCl ( ). Protein 

elution was monitored at wavelength A280 ( ), and fractions of 38 ml were assayed for XT 

activity (■ ). The horizontal bracket indicates fractions collected for further purification. 
Panel B> Ion exchange chromatography onPOROS 20 HQ : The desalted and concentrated 

10 product from heparin affinity chromatography was loaded onto a POROS 20 HQ column. The 
column was washed with buffer A. Adsorbed proteins were eluted stepwise with NaCl ( - - ). 

Protein elution was monitored at A280 ( ), and fractions of 50 ml were assayed for XT 

activity (■). The horizontal brackets indicate the fractions collected for further purification. 
Panel C Protamine chloride affinity chromatography: The desalted and concentrated product 

15 from ion exchange chromatography was applied to a protamine chloride POROS column. After 
a washing step with buffer A, the column was eluted stepwise with NaCl 

( ). Elution was monitored at A28O ( X and fractions of 6 ml were assayed for XT 

activity ( ■ ). The horizontal bracket indicates the fractions collected. 
Figure 2 SDS-PAGE of XT fractions at various purification steps 

20 Panel A: XT fractions at various purification steps were subjected to 4-12% 

gradient polyacrylamide gel for SDS-PAGE. Lane I, JAR cell culture supernatant 

(crude material); lane II, ultrafiltration retentate; lane III dissolved ammonium 

sulfate precipitate; lane IV, protein eluted with NaCl from the POROS 20 HE column; lane 

V, protein eluted with NaCl from the POROS 20 HQ column; lane VI protein eluted with NaCl 

25 from the protamine chloride POROS column. Lane M t molecular size standard: myosin 

(200.000), phosphorylase b (97.000), bovine serum albumin (66.000), glutamic dehydrogenase 
(55.000), carbonic anhydrase (31.000), trypsin inhibitor (22.000). The gel was silver-stained. 
Panel B: Collected XT fractions from protamine affinity chromatography were subjected to 
SDS-PAGE on a 4-12% gradient polyacrylamide gel. The arrows indicate the stained bands 

30 corresponding to the unknown protein, which was shown to be XT (120.000) (1), hexose-6- 
phosphate dehydrogenase (89.000) (2), ezrin (68.600) (3), quiescin Q6 (64.000) (4\ 
plasminogen activator (47.000) (5), aldolase A (39.000) (6) and low molecular artefacts (7). All 
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protein bands were excised, enzymatically digested, and the peptide mixture was characterized 
by MALDI-TOF mass spectrometry. The gel was stained with Coomassie Brilliant Blue. 
Figure 3: Immunoaffinity chromatography of xylosyltransferase 

Panel A. 100 jil of the desalted XT-containing fractions eluted from heparin affinity matrix was 
5 applied to a column of immobilized polyclonal antibodies. After washing with buffer D, the 
column was eluted as indicated by the arrow with buffer D / 0.15 M NaCl (1) followed by 12 
mM HC1 (2). Fractions of 1 ml were collected into tubes containing 1 ml of 0.1 M Tris/HCl, pH 

8.0. Protein elution was monitored at 280 nm ( ) and the XT activities (■) of each fraction 

were assayed. 

10 Panel B t Negative control of the immunoaffinity chromatography with immobilized preimmune 
serum. For conditions see Panel A. 

Panel C, Immimoaffinity chromatography with immobilized polyclonal antibodies. The column 
was eluted with buffer D containing 100 \xl (1 mg/ml) of the peptide antigen indicated by the 
arrow. Fractions of 1 ml were collected into tubes containing 1 ml of 0.1 M Tris/HCl, pH 8.0. 

15 Protein elution was monitored at 280 nm ( ) and the XT activities (■) of each fraction 

were assayed. 

Figure 4: Aprotinin affinity chromatography of partially purified XT and 
immunoblot analysis of the separated fractions 

Panel A. 200 fil desalted XT fraction from the heparin purification step was applied to an 
20 aprotinin column previously equilibrated with buffer A. After washing with buffer A, the 

adsorbed proteins were eluted stepwise with NaCl ( ). Protein elution was monitored at 

A280 ( ) and fractions of 2 ml were assayed for XT activity (■). 

Panel B, Aliquots of fraction 3, fraction 7, fraction 12 and fraction 16 were analyzed by western 
blot. The MW 120.000 protein was detected in the XT-containing fraction 7 and 12. 

25 Immunological detections were performed using a polyclonal rabbit antiserum raised against 
the synthetic peptide CSRQKELLKRKLEQQEK deduced from the peptides 2 and 10 of the 
enzymatically cleaved unknown protein. Prestained molecular size standard were myosin 
(190.000), BSA (64.000), glutamic dehydrogenase (51.000). 
Figure 5 : Gel filtration chromatography and N-glycosidase F digestion 

30 Panel A, 100 fil of the XT containing fraction from heparin affinity HPLC was applied to a 
TSK G3000 SW column (30 cm X 7.5 mm, 10 ^im particle size) previously equilibrated with 
buffer A / 0.15 M NaCl. Elution was performed with the same buffer. Fractions of 200 julI were 
collected and assayed for XT activity (■). Protein elution was monitored at A280 ( )- The 
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arrows indicate the elution positions of thyroglobulin (669.000) (1), ferritin (440.000) (2), 
aldolase (158.000) (3), albumin (67.000) (A\ ovalbumin (43.000) f5), chymotrypsinogen A 
(25.000) (6), and ribonuclease A (13.700) 0). 

Panel B. An aliquot of 1 jig of the MW 120.000 protein was analyzed by SDS-PAGE 

5 (I). Aliquots (1 ng) of the MW 120.000 protein were digested with 3,1 x 10 " 3 units of N- 
glycosidase F at 37°C for 1 h (II) and 12 h (EI). The samples were then subjected to SDS- 
PAGE, and protein bands were detected by silver staining. The molecular size standard were 
myosin (200.000), b-galactosidase (116.000), phosphorylase b (97.000). 
Figure 6: Cloning strategy for human XT-I and XT-II isoform. 

10 (A) The peptides 4, 6, 7, and 9 are sequences that were most favorable for the design of 
degenerate PCR primers. The amino acid sequence of the peptides was obtained after 
proteolytic digestion of the purified human XT. The strategy for cloning of XT-I cDNA (B) and 
XT-II isoform (C) is illustrated. The open reading frame of XT-I and XT-II is shown as a filled 
box, and the location of the peptides 4, 6, 7 and 9 is illustrated by open boxes. The location and 

15 orientation of the degenerate primers employed for cloning of XT-I and XT-II are marked by 
arrows. XT-I and XT-II cDNA inserts contained within the indicated plasmids were obtained 
using RT-PCR with degenerate primers (pCGl 11-4, pCGl 14-29, pCGl 10-7), 5' RACE 
(pCG185-21, pCG212-19, pCG319-23, 3' RACE (pCG204-38, pCG21 1-4) and RT-PCR with 
gene-specificprimers(pCG176-l).H = A + C + T, Y = C + T,R = A + G,N = A+G + C + T,I 

20 = deoxyinosine. 

Figure 7 A, B: Nucleotide (A) and deduced amino acid (B) sequence of the hXT-L 
The position of the peptide sequences obtained by digestion of the purified human XT are 
underlined (B). Potential N-glycosylation sites are double underlined. The DxD motif is 
intensified depicted. The figures are identical with SEQ ID 1 and 2. 
25 Figure 8A, B: Nucleotide (A) and amino acid (B) sequence of the hXT-U isoform. 

Potential N-glycosylation sites are double underlined (B). The putative transmembrane domain 
is underlined. The figures are identical with SEQ ID 3 and 4. 

Figure 9; Nucleotide sequence of rXT-I (SEQ ID 5). The corresponding protein sequence is 
depicted in SEQ ID 6. 

30 Figure 10: Nucleotide sequence of rXT-U (SEQ ID 7). The corresponding protein sequence is 
depicted in SEQ ID 8. 

Figure 11 : Differential expression of the XT-I, XT-II gene in human tissues. Semiquantitative 
RT-PCR with normalized first-strand cDNA was used to examine the abundance of XT-I and 
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XT-II transcripts. A 983 bp fragment of the housekeeping gene glyceraldehyde-3 -phosphate 
dehydrogenase cDNA was amplified as control (A). The 490 bp XT-I cDNA fragment (B) and 
the 717 bp fragment amplified from XT-II isofonn cDNA (C) were detected in each tissue 
indicating that both enzymes are ubiquitously expressed. The arrows indicate the expected 
5 position for each PCR product. . 

The following examples describe the invention in more detail without to limit scope of the 
technical teaching. 

Example 1: Materials for Isolation and Purifications. Human JAR choriocarcinoma cells 

10 were purchased from ATCC (Rockville, MD). Dried UltraDOMA-PF medium was obtained 
from BioWhittaker (Vervier, Belgium) and aqua ad injecta from Braun (Melsungen, Germany). 
Heat-inactivated fetal calf serum, Dulbecco's phosphate-buffered saline, antibiotic/antimycotic 
solution, trypsin-EDTA solution, trypan blue, protamine chloride and the Bicinchoninic Acid 
Protein Assay Kit were purchased from Sigma (Deisenhofen, Germany). Cell culture 

15 flasks, serological pipettes, and sterile tubes were purchased from Becton 

Dickinson (Heidelberg, Germany). The hybrid hollow-fiber bioreactor TECNOMOUSE® 
was supplied by Integra Biosciences (Fernwald, Germany), the ACA analyzer by 
Dade Diagnostica (Mtinchen, Germany) and the Super G analyzer by RLT 
(MOhnesee, Germany). UDP-[ 14 C]xylose (9.88 kBq / nmol) came from DuPont (Bad 

20 Homburg, Germany), 25 mm diameter nitrocellulose discs from Sartorius (Gottingen, 

Germany), scintillation mixture and the liquid scintillation counter LS500TD was obtained 
from Beckman Coulter (Fullerton, CA). Ultrafiltration cells, YM1 membranes and PVDF 
membranes (Immobilon P) were purchased from Millipore (Eschborn, Germany). 
The chromatography media POROS 20 HQ, POROS 20 HE2, POROS 20 AL, POROS 20 EP 

25 and the HPLC workstation Biocad Sprint were supplied by Perseptive 

Biosystems (Framingham, MA). The gel filtration column TSK G3000 SW (30 cm x 7.5 mm, 
10 |nm particle size) was obtained from TosoHaas (Montgomeryville, PA). The MALDI mass 
spectrometer Reflex II was from Bruker Daltonik GmbH (Bremen, Germany) and protein 
sequencer Precise 494 cLC was purchased from PE Biosystems (Framingham, MA). Precast 

30 polyacrylamide gels, buffers, and NuPAGE electrophoresis system XCell II Mini-Cell and Blot 
Module were from Novex (San Diego, CA). The synthetic peptide CSRQKE.LLKREXEQQEK 
and the rabbit antiserum were purchased from Bioscience (Gottingen, Germany). Peroxidase- 
conjugated affinipure F(ab')2 fragment goat anti-rabbit IgG (H+L) was purchased from 
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Dianova (Hamburg, Germany). N-glycosidase F was obtained from Roche (Mannheim, 
Germany). 

Example 2: Cell culture. JAR choriocarcinoma cells releasing XT in the cell culture 
supernatant were cultured in Ultradoma-PF medium containing 10% heat-inactivated fetal calf 

5 serum, 100 U/ml penicillin, 100 |ig/ml streptomycin and 250 ng/ml amphotericin B in a 

humidified atmosphere of 5% C02 and 95% air at 37°C. After incubation for 24 h in the serum- 
containing medium, the cell cultivation was adapted to serum-free conditions as described 
previously (6). Scaling up of XT production was carried out in three hybrid hollow-fiber 
bioreactors TECNOMOUSE®. During the exponential growth the cells from three 175 cm 2 T- 

10 flasks (> 3 x 10 7 cells) were detached with 0.5% trypsin and 0.2% EDTA in Dulbecco's 

phosphate-buffered saline by incubation at 37°C for 10 min. After centrifugation (5 min, 1000 x 
g) of the cell suspension the cell pellet was resuspended in 10 ml 37°C warm serum-free and 
protein-free Ultradoma-PF medium and washed three times with 20 ml of the same medium. 
The cell suspension was drawn into a 10 ml syringe and then inoculated into the extracapillary 

15 space (EC space) of the reactor. The bioreactor was connected with a 2 liter medium bottle and 
set to 150 ml/h in the recirculation mode, and the oxygenation pump was set as described in the 
operating manual. Five days after inoculation a 10 ml syringe was connected to the left hand 
EC port and 10 ml of cell culture supernatant was harvested from the EC space. The harvesting 
was continued every two days over a period of 3 months. Glucose and lactate concentration of 

20 the cell culture supernatant were controlled using the Super G analyzer and the ACA analyzer, 
respectively. The 2 liter medium bottle was replaced every three days. The viability of the cells 
was determined by trypan blue exclusion. 

Example 3: Synthesis of the protamine affinity matrix. Protamine chloride was immobilized as 
ligand on POROS 20 AL. 30 mg ligand was dissolved in 10 ml of 10 mM phosphate / 0.15 M 

25 NaCl, pH 7.4. After the protein had been dissolved, 5 ml of 1 00 mM phosphate / 1 .50 M NaCl, 
pH 7.4 was added. NaCNBHh was dissolved in the ligand/buffer solution to a final 
concentration of 5 mg/ml and 1.0 g POROS 20 AL was suspended in the same solution. 
The suspension was mixed gently on a shaker for 1 minute at RT. An additional 2 ml of 100 
mM phosphate / 1 .50 M NaCl, pH 7.4 was added to the suspension and the mixture was shaken 

30 continuously. This step was repeated every five minutes until the mixture volume was 25 ml. 
After additional shaking for 2 h, the medium was filtered on a sintered glass funnel. The matrix 
was suspended in 20 ml of 0.2 M Tris/HCl / 5 g/1 NaCNBH 3 , pH 7.2 and mixed gently on a 
shaker for 30 min at RT. After the media had been washed in a sintered glass funnel using 100 
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ml of 10 mM phosphate, pH 7.4, 100 ml of 1.0 M NaCl and another 100 ml of 10 mM 
phosphate, pH 7.4, the matrix was packed in a PEEK column (4.6 x 50 mm). 
Example 4: Synthesis of the aprotinin affinity matrix. Aprotinin affinity matrix was synthesized 
according to the synthesis of the protamine affinity matrix, but using aprotinin as ligand. After 

5 immobilization of the ligand the matrix was packed in a PEEK column (4.6 x 50 mm). 
Example 5: Purification of xylosyltransferase from JAR cell culture supernatant 
Fractionated ammonium sulfate precipitation and chromatography steps were performed at RT, 
ultrafiltration and diafiltration were carried out at 4°C. 1 8.5 liters JAR cell culture supernatant 
collected from three hybrid hollow-fiber bioreactors, TECNOMOUSE®, each containing 5 

10 culture cassettes, was concentrated to 800 ml with ultrafiltration cells using YM1 cellulose 
membranes. The retentate was centrifiiged at 4,000 x g for 1 h. The supernatant was decanted, 
and the pellet was discarded. 

Step 1: Fractionated ammonium sulfate precipitation - Solid ammonium sulfate was added to 
the supernatant to 28% saturation. After 1 h at RT the suspension was centriftiged at 4,000 x g 

15 for 2 h, the supernatant was decanted, and the precipitate was removed. Additional ammonium 
sulfate was added to the solution to the point of 40% saturation, and the suspension was 
allowed to stand for 1 h. To recover the precipitate the supernatant was decanted after the 
suspension was centrifiiged at 4,000 x g for 2 h. Before chromatography on immobilized 
heparin the precipitate was dissolved in 460 ml buffer A (20 mM sodium acetate, pH 6.0). 

20 Step 2: Heparin affinity chromatography on POROS 20 HE2 - The step 1 product was passed 
through a 0.2 \\m filter. 4.0 ml of the filtrate was applied to a POROS 20 HE2 column (16 x 100 
mm) equilibrated with buffer A at a flow rate of 40 ml/min. After washing the column with 100 
ml of buffer A the XT activity was eluted with the same buffer containing NaCl. The NaCl 
concentration was increased stepwise: 20 ml buffer A / 0.09 M NaCl; 20 ml buffer A / 0.15 M 

25 NaCl; 30 ml buffer A / 0.24 M NaCl; 24 ml buffer A / 0.30 M NaCl; 24 ml buffer A / 0.60 M 
NaCl; 24 ml buffer A / 1 .00 M NaCl; and 24 ml buffer A / 1.89 M NaCl. Fractions of 38 ml 
each were collected and the XT activity was measured. The procedure was repeated 115 times 
by cyclic chromatography and the fractions containing XT activity (1 15 x 38 ml) 
were collected. 

30 Step 3: Ion exchange chromatography on POROS 20 HQ - Collected fractions from step 2 were 
desalted using diafiltration with YM1 cellulose membranes and ultrafiltration cells. After 
concentration of the desalted protein solution to 0.05 liter using analogous techniques the XT- 
enriched solution was subjected to ion exchange chromatography. 4.0 ml of the XT solution 
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was applied onto the POROS 20 HQ column (16 x 100 mm) previously equilibrated with buffer 
A at a flow rate of 40 ml/rnin. The column was washed with 80 ml buffer A, and the adsorbed 
protein was eluted stepwise using the same buffer containing 0.07 M NaCl (88 ml), 0.18 M 
NaCl (120 ml), and 0.36 M NaCl (120 ml) followed by a linear gradient of 0.36-1 .00 M NaCl 
5 (200 ml) and another step of buffer A/2.0 M NaCl (120 ml). 50 ml fractions were collected 
and assayed for activity and evaluated by SDS-PAGE. Chromatography was repeated 13 times, 
and the fractions exhibiting XT activity (13 x 50 ml) were collected for affinity 
chromatography. 

Step 4: Affinity chromatography on protamine chloride - "XT-containing solution from step 3 

10 was desalted as described above and concentrated to 5 ml by ultrafiltration with YM1 cellulose 
membranes. The ultrafiltration product was passed through a 0.2 jam filter. 100 ^il of the filtrate 
was loaded onto a protamine chloride - POROS column (4.6 x 50 mm) equilibrated with buffer 
A. The flow rate was 10 ml/min. The column was washed with 10.0 ml of buffer A, and the 
adsorbed fraction was eluted with the same buffer containing NaCl by a stepwise increase of 

15 the NaCl concentration: 6.6 ml buffer A / 0.04 M NaCl; 6.6 ml buffer A / 0,06 M NaCl; 6.6 
ml buffer A / 0.23 M NaCl followed by a linear gradient of 0.23 - 1 .20 M NaCl (4.2 ml) 
in buffer A. Fractions of 6.0 ml were collected, assayed for XT activity and evaluated by SDS- 
PAGE. Cyclic chromatography was repeated 50 times. The purified enzyme was collected, 
concentrated to 1 .0 ml using ultrafiltration techniques and stored at - 75 °C. 

20 Step 5: SDS-Polyacrylamide gel electrophoresis (SDS-PAGE) - The protein composition of 
various fractions was estimated by SDS-PAGE. Briefly, 12.1 \xl of sample was added to 4.7 |il 
of sample buffer (1.00 M Tris/HCl / 1.17 M sucrose, 0.28 M SDS, 2.08 raM EDTA, 0.88 mM 
Serva Blue G250, 0.70 mM phenol red, 0.10 M DTT, pH 8.5) and heated for 10 minutes at 
99°C. After the sample had been loaded, SDS-polyacrylamide gel electrophoresis was carried 

25 out on a 4-12% bis-tris polyacrylamide gel with 3-(N-morpholino) propanesulfonic acid 

(MOPS) running buffer (1.00 M MOPS/Tris / 69.3 mM SDS, 20.5 mM EDTA, pH 7.7). Protein 
bands were detected by Coomassie Brilliant Blue or by silver staining. The Coomassie bands 
were excised and characterized by MALDI mass spectrometry and amino acid sequence 
analysis. 

30 Example 6: MALDI mass spectrometry. Coomassie-stained proteins were excised from the gel, 
repeatedly washed with H2O and H20/acetonitrile and digested overnight with trypsin and 
endoproteinase Lys-C at 37°C. The peptides generated in the supernatant were analyzed by 
MALDI mass spectrometry. Sample preparation was achieved following the thin film 
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preparation techniques (13). Briefly, aliquots of 0.3 \x\ of a nitrocellulose containing saturated 
solution of a-cyano-4- hydroxycinnamic acid in acetone were deposited onto individual spots 
on the target. Subsequently, 0.8 |il 10% formic acid and 0.4 [il of the digest sample was loaded 
on top of the thin film spots and allowed to dry slowly at ambient temperature. To remove salts 

5 from the digestion buffer the spots were washed with 10% formic acid and with H2O. MALDI 
mass spectra were recorded in the positive ion mode with delayed extraction on a Reflex II 
time-of-flight instrument equipped with a SCOUT multiprobe inlet and a 337 nm nitrogen laser. 
Ion acceleration voltage was set to 20.0 kV, the reflector voltage was set to 21 .5 kV and the 
first extraction plate was set to 15.4 kV. Mass spectra were obtained by averaging 50-200 

10 individual laser shots. Calibration of the spectra was performed internally by a two-point linear 
fit using the autolysis products of trypsin at m/z 842.50 and m/z 221 1.10. 
Example 7: Amino acid sequence analysis of XT. The MW 120.000 Coomassie-stained protein 
was excised from the gel, repeatedly washed with H2O and HtO/acetonitrile and digested with 
trypsin and endoproteinase Lys-C overnight. For HPLC separation the excised gel fragment 

15 was extracted twice with 0. 1% TFA/60% acetonitrile. The extracted enzymatic fragments were 
separated on a capillary HPLC system equipped with a HOB solvents delivery system 
(PE Biosystems), Acurate splitter (LC-Packings), UV absorbance detector 759A 
(PE Biosystems), U-Z capillary flow cell (LC-Packings) and Probot fraction collector 
(BAI) using reversed-phase column (Hypersil CI 8 BDS, 3 \xm, 0.3 x 150 mm) and a linear 

20 gradient from 12% acetonitrile, 0.1% TFA to 64% acetonitrile, 0.08% TFA in 90 min with a 
flow rate of 4 jil/min at RT. Peptide elution was monitored at 214 nm and individual fractions 
from the HPLC separation were analysed by MALDI mass spectrometry. Sequence analysis 
of separated fragments was performed on a Procise Protein Sequencer 494 cLC using standard 
programs supplied by PE Biosystems. 

25 Example 8: Determination of XT activity. Determination of XT activity is based on the 
incorporation of [ 14 C]-D-xylose into the recombinant bikunin according to a previously 
described method (Brinkmann et al., 1997, J. Biol. Chem., 272, 11171-11175). For analysis of 
the substrate specificity of the recombinant xylosyltransferases synthetic peptides containing 
the XT recognition sequence were used as acceptor. The reaction mixture for the assay 

30 contained, in a total volume of 100 fil: 50 |il of XT solution, 25 mM 
4-morpholineethanesulfonic acid (pH 6.5), 25 mM KC1, 5 mM KF, 5 mM MgCk, 5 mM 
MnCb, 1.0 |iM UDP-[ ,4 C]-D-xylose, and 1.5 |iM of the synthetic peptides. After incubation for 
75 min at 37°C, the reaction mixtures were placed on discs (25 mm diameter) of Immobilori- 
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AV membrane, which immobilizes even small peptides by covalent links (Pfund & Bourdage, 
1990, Mol Immunol 27, 495-502), and allowed to dry. It was then washed for 10 min with 
10% trichloroacetic acid and three times with 5% trichloroacetic acid solution. Incorporated 
radioactivity was determined by liquid scintillation counting. 

5 Example 9: Antiserum Preparation. The synthetic peptide CSRQKELLKRKLEQQEK 
deduced from the sequenced peptides 2 and 10 of the enzymatically cleaved XT was 
synthesized, purified by HPLC and used for immunization of rabbits (Bioscience, Germany). 
Polyclonal antiserum was obtained by injection of the above antigen followed Example 10: 
Preparation of solid-phase antigen The antigen CSRQKELLKRKLEQQEK was 

10 immobilized on POROS 20 EP. After 1.6 mg antigen was dissolved in 1.2 ml of 10 mM 
phosphate / 0.15 M NaCl, pH 7.4 0.60 ml of 100 mM phosphate / 1.50 M NaCl, pH 7.4 was 
added. 400 mg POROS 20 EP was suspended in the solution and the suspension was mixed 
gently on a shaker at RT. At 10-min intervals, five times in total, an additional 0.24 ml of 100 
mM phosphate / 1.50 M NaCl, pH 7.4 was added to the suspension. After additional shaking for 

15 5 days at RT the suspension was filtered on a sintered glass funnel. The matrix was suspended 
in 4 ml 0.2 M phosphate / 0.1 M 2-mercaptoethanol, pH 7.4 and mixed on a shaker for 2 h at 
RT. The matrix was washed in a sintered glass funnel using 20 ml of 10 mM phosphate / 0.15 
M NaCl, pH 7.4 and 20 ml of 1.0 M NaCl. After additional washing with 20 ml of 10 mM 
phosphate, pH 7.4, the matrix was packed in a PEEK column (4.6 x 50 mm). 

20 Example 11: Antibody purification. Antiserum was adjusted to 50 mM Tris/HCl, pH 8.0. 
The solution was clarified by passage through a 0.2 \xm filter. 0.4 ml filtrate was applied at 10 
ml/min to the antigen column previously equilibrated with buffer B (50 mM Tris/HCl, pH 8.0). 
After the column was washed with 4. 1 ml buffer B and with 12.5 ml buffer B / 0. 1 5 M 
NaCl, the adsorbed antibody was eluted using 12.4 ml of 50 mM sodium citrate / 0.15 M NaCl, 

25 pH 3.0 followed by 3.4 ml of 100 mM sodium citrate / 1.5 M NaCl, pH 3.0. The eluate was 
collected as 10 ml fractions in tubes containing 2 ml of 0.5 M Tris/HCl, pH 8.0, to immediately 
neutralize the citric acid. 

Example 12: Preparation of immunoaffinity column. Purified antibody was concentrated 
to a protein concentration of 0.3 mg/ml using ultrafiltration with YM1 cellulose membranes. 
30 The antibody solution was adjusted to 1 0 mM phosphate / 0. 1 5 M NaCl, pH 7.4. After filtration 
of the solution through a 0.2 ^m filter 100 jxl filtrate was applied at 0.2 ml/min to a POROS 20 
PA column (2.1 x 30 mm). This step was repeated 17 times. Adsorbed antibody was cross- 
linked using cross-linking solution (100 mM triethanolamine, pH 8.5). After the column was 
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washed with 5 ml of buffer C (10 mM phosphate / 0.15 M NaCl, pH 7.4) 2 ml cross-linking 
solution was applied at 0.5 ml/min onto the cartridge. The procedure was repeated 6 more 
times, using a total volume of 14 ml cross-linking solution. To block unreacted functional 
groups on the cross-linking reagents 2 ml of 100 mM monoethanolamine, pH 9.0 

5 (quenching solution), was loaded onto the cartridge at 0.5 ml/min. The column was 
washed using 2 ml of buffer C and the cross-linking step was repeated using another 2 
ml quenching solution. The immunoaffinity column was cycled between buffer C and 12 mM 
HQ / 0. 1 5 M NaCl 3 times using a total volume of 12 ml .of solution. 
Example 13: Immunoaffinity column purification of XT. XT-containing fractions eluted 

10 from the heparin affinity matrix were desalted using diafiltration with YM1 cellulose 

membranes and passed through a 0.2 pm filter. 100 pi of this XT-sample was applied to the 
immunoaffinity column equilibrated with buffer D (20 mM Tris/HCl, pH 8.0) at a flow rate of 1 
ml/min. The column was washed with 1.4 ml of buffer D and with 8.5 ml of buffer D / 0.15 M 
NaCl. The XT activity was eluted with 4.2 ml of 12 mM HC1 followed by 1 .2 ml of 12 mM 

1 5 HC1 / 1.5 M NaCl. Alternatively the elution was performed using 1 00 pi of antigen at 1 mg/ml 
in buffer D. Fractions (1 ml) were collected into tubes containing 1 ml of 0.1 M Tris/HCl, 
pH 8.0. The XT activities of the fractions were determined. 

Example 14: Aprotinin affinity chromatography. 200 pi of desalted XT solution from the 
heparin purification step was applied at 10 ml/min to the aprotinin column previously 

20 equilibrated with buffer A. After washing the column with 6.6 ml of buffer A the adsorbed ' 
protein was eluted stepwise using the same buffer containing 0.3 M NaCl (10.0 ml), 0.54 M 
NaCl (1 0.0 ml), 1 .00 M NaCl (10.0 ml) and 1 .50 M NaCl (2.4 ml). Example 15: Western 
Blot Analysis. For western blot analysis, proteins were transferred to polyvinylidene 
difluoride membrane in a semi-dry instrument (Novex). After transfer nonspecific 

25 antibody binding sites were blocked with 2% BSA in 0.1 M Tris/HCl, pH 7.2, for 1 h at RT. 

The membrane was incubated with antiserum in 50 mM phosphate / 0.15 M NaCl, 0.5 

t 

ml/1 Tween 20, pH 7.4 at 1 : 1000 dilution for 1 h. Bound antibody was detected using a second 
anti-rabbit goat immunoglobulin coupled to horse-radish peroxidase at a 1:1000 dilution. The 
blot was developed using 4-chloro-l-naphthol. 
30 Example 16: Gel filtration chromatography. A sample of 100 pi from the heparin 

purification step was applied at 1.0 ml/min to a TSK G3000 SW column (30 cm x 7.5 mm, 10 
|im particle size) which had previously been equilibrated with buffer A / 0.15 M NaCl. Proteins 
were eluted with the same buffer. Fractions of 200 pi were collected and tested for XT activity. 
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Column calibration was performed using thyroglobulin (669.000), ferritin (440.000), 
aldolase (158.000), albumin (67.000), ovalbumin (43.000). 

Example 17: N-glycosidase F digestion. Aliquots (1 \ig) of XT were digested with 3,1 x 
10 " 3 units of N-glycosidase F at 37°C for 1 h and 12 h (Table II). The samples were then 
5 subjected to SDS-PAGE, and protein bands were detected by silver staining. 

Example 18: Measurement of protein concentration. Protein concentration was estimated 
by absorbance at 280 nm assuming E 1% i cm = 10.0 or with the Bicinchoninic Acid Protein 
Assay using bovine serum albumin as a standard. 

Example 19: PCR-based cloning of human xylosyltransf erase. Degenerate oligonucleotide 

10 primers with deoxyinosine substitution were designed based upon the amino acid sequence of 
peptides obtained after digestion of the isolated human XT with trypsin or Lys-C. The first 
strand of cDNA was synthesized by the reverse transcription reaction using poly( A) + RNA 
isolated from the chondrosarcoma cell line SW1353 as template and oligo(dT) as primer. The 
reverse transcription reaction was performed at 37°C for 2 h using 50 pmol of oligo(dT) 

15 primers, 1 \xg of poly(A) + RNA, a 0.5 mM concentration of each dNTP, 1.x RT buffer and 200 
units of RNase H deficient Moloney murine leukemia virus reverse transcriptase (Life 
Technologies; Eggenstein, Germany) in a final volume of 20 jxl. For PCR amplification the 
reaction mixture contained 4 |il of the reverse transcription reaction solution, 50 pmol of each 
primer, a 0.25 mM concentration of each dNTP, 50 mM KC1, 10 mM Tris-HCl (pH 8.3), L5 

20 mM MgCk and 2.5 units of hot start Taq polymerase (Life Technologies) in a final volume of 
50 jliI. Amplification with degenerate oligonucleotide primers was carried out by 40 cycles at 
94°C for 1 min, 48°C for 1 min, and 72°C for 2 min, followed by a final extension step at 72°C 
for 15 min. After agarose electrophoresis of the PCR products, DNA fragments were excised, 
subcloned into the pCR2.1 vector (Invitrogen. Groningen, Netherlands) and sequenced by the 

25 dideoxy chain termination method using Taq DNA polymerase (Big-dye terminator cycle 

sequencing kit, Perkin-Elmer, Norwalk, CT, USA) with an automated DNA sequencer (Applied 
Biosystems, Weiterstadt, Germany). Several clones were sequenced to compensate for 
misreading by Taq polymerase. 

Example 20: Rapid amplification of 5 ' and 3 ' cDNA ends (RACE). For amplification of the 5' 
30 and 3' ends of the XT-I and XT-II isoform cDNA RACE experiments were performed using 
commercially available systems (Clontech, Heidelberg, Germany; Life Technologies) according 
to the manufacturers' instructions. For 3 '-RACE 1 |ig placenta poly(A) + mRNA (Clontech) was 
reverse transcribed with a 3 '-CDS primer (Clontech). PCR amplification of the 3' cDNA end of 
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human XT-I was accomplished according to a touch-down PCR protocol (5 cycles: 94°C for 30 
s, 72°C for 3 min, 5 cycles: 94°C for 30 s, 70°C for 45 s, 72°C for 3 min, 25 cycles: 94°C for 30 
s, 65°C for 45 s, 72°C for 3 min) with the gene-specific primer GSPlb 5 s - 
GTGGGTATGC AG AAGTGGGGGAAGGG AC-3 9 and the UPM primer mix (Clontech). An 

5 aliquot of the first PCR was subjected to semi-nested PCR (30 cycles) using the primer 
Con21 1 1 5'-CCCTCCGCAATGCCTACA-3' and the UPM primer mix. The DNA fragments 
obtained were subcloned into the vector pCR2.1 (Invitrogen) and sequenced. PCR amplification 
of the 3' cDNA end of the XT-II isoform was carried out with the primers AB 10267 5'- 
ACTGAGGTCACGC AATAC AA-3 * and UPM using the touch-down PCR protocol. For 5'-RACE 

10 1 \xg of placenta poly(A) + mRNA was subjected to a reverse transcription and 5' tailing reaction 
with the 5'-CDS primer and the SMART oligo (Clontech) according to the manufacturer's 
protocol. The 5' end of the XT-I cDNA was amplified using the primer GSP_776L 5 9 - 
GCCGCACTCAG-GTGATGAAGAAGT-3 9 and UPM with a touch-down PCR protocol. An 
aliquot was used as template in a second semi-nested PCR reaction with the primers GSP_503L 

15 S'-ACCA-CCAGGACAAAGGCGATTCTGA-3' and UPM. The 5'-cDNA end of XT-II was 

obtained by 5'-RACE amplification using the primers AB10315L 5 ' -AGTCGAAC AGTCC AGG- 
GCCC-3' and UPM mix. A new primer for a 5 '-RACE reaction was designed based upon the 
nucleotide sequence of the largest fragment obtained (2 kbp), and the experiment was repeated 
with the primer AB5846L 5 5 -C ACGATCTCGCACTTGGGGG-3 ' as described above. The 

20 nucleotide sequences of human XT-I and XT-II cDNA have been submitted to the 
GenBank/EBI Data Bank with the accession numbers AJ277441 and AJ277442. 
Example 21: Isolation ofXT-IcDNA from human brain and chondrocyte cDNA libraries. 
Plasmid-DNA was isolated from a human whole brain cDNA library (Life Technologies) and a 
human chondrocyte cDNA library (Clontech) and used as template in a PCR-based approach 

25 for isolation of the 5' end of the XT-I cDNA. Briefly, GSPJ776L and 5'ADLD 5'- 

CTATTCGATG ATGAAGAT ACCCCACC AAACCC-3 ' primers were used in a PCR reaction with 
I |!g of plasmid DNA as template. The DNA fragments were subjected to agarose gel 
electrophoresis and 0.7-3 kb long fragments were excised from the gel and used as a template in 
a semi-nested PCR reaction with GSP_503L and S'-ADLD primers. DNA fragments were then 

30 subcloned into the vector pCR2. 1 and sequenced. 

Example 22: Cloning of rat XT-I and XT-II cDNA. A RT-PCR based approach using primers 
based upon the nucleotide sequence of human XT-I and XT-II was employed for amplification 
of XT-I and XT-II cDNA from rat. The first strand of cDNA was synthesized by the reverse 
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transcription reaction using poly(A) + RNA isolated from the rat liver cell line BRL3A as 
template and oligo(dT) as primer. DNA fragments obtained after PCR amplification using 
moderate stringent conditions were subcloned into the vector pCR2.1 and sequenced. The 5' 
and 3' ends of the XT-I and XT-II cDNA from rat tissue were amplified using the RACE 
5 strategy with gene-specific primers as described above. The nucleotide sequences of rat XT-I 
and XT-II cDNA have been submitted to the GenBank/EBI Data Bank with the accession 
numbers AJ295748 and AJ295749. 

Example 23: Expression levels of the XT-I and the XT-II isoform in human tissues. 

A Human Multiple Tissue cDNA Panel (Cfontech) was used for the analysis of expression 

10 levels. Levels of amplification of the housekeeping gene glyceraldehyde-3 -phosphate 

dehydrogenase, whose transcript is always present in the tissues at an almost constant level, 
were determined in parallel for quality control. For amplification of the XT-I encoding 
transcript the primers 128U and 601L were used, whereas primers AB10267U and AB12394L 
5 '-GG AAGAGCTGGGTGTGGAAT-3 ' were employed for the XT-II. PCR reactions were carried 

15 out by 22-36 cycles at 94°C for 30 s, 60-65°C for 45 s and 72°C for 2 min. Amplification of a 
transcript was performed using a serial number of cycles to find the conditions for 
semiquantitative amplification, and aliquots were analyzed by agarose gel electrophoresis. 
Example 24: Transfection and transient expression of XT. For construction of a eukaryotic 
expression vector a DNA fragment including the known coding sequence of XT-I cDNA was 

20 amplified by PCR using XT_ExplL 5 '-TTTCCCGTTGAGATCCTGCT-3 1 and XTJExp3U 5'- 
ACAGACAGCAACAACGAGAA-3' as primers and placenta first-strand cDNA (Cldntech) as 
template. The 2450 bp fragment obtained was cloned into the vector pcDNA4/HisMax-TOPO 
(Invitrogen) resulting in the fusion of XT-I to the Xpress epitope. The plasmid pCG227-XT-I 
was then transiently transfected into CHO-K1 cells. The coding region of the XT-II cDNA was 

25 amplified by PCR using the primers AB_Exp9U 5 '-AAAGGAAGGCAGAGGAAGC-3 ' and 
AB_Exp3L 5'-ACCCCTCCACTGT-CTGTAAG-3' and placenta first-strand cDNA as template. 
The obtained 2440 bp DNA fragment was cloned into the expression vector pcDNA4/HisMax- 
TOPO resulting in the fusion of XT-II to the aminoterminal Xpress epitope. The plasmid 
termed pCG226-XT-II was transiently transfected into CHO-K1 cells. 2x1 0 5 cells precultured 

30 for 1 day in a 35 mm diameter cell culture dish were transfected with 2 jig of plasmid DNA and 
6 ^1 of Fugene 6 transfection reagent (Roche, Mannheim, Germany). For determination of 
transfection efficiency CHO-K1 cells were transfected with 2 ^g of the control plasmid 
pcDNA4/HisMax-TOPO-/acZ. 48 h after transfection the cell culture medium was harvested. 
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Protein G agarose beads (Sigma, Deisenhofen, Germany) and mouse znti-Xpress monoclonal 
antibody (Invitrogen) were added to the cell culture supernatant and incubated at 4°C for 1 h. 
After centrifugation at 10.000 g for 1 min the absorbed proteins were twice washed with PBS 
and resuspended in a final volume of 50 \xL XT activity was then assayed in the samples. 

5 Example 25: Characterization of the reaction products. For characterization of the reaction 
product of recombinant XT-I the peptide QEEEGSGGGQK, which is homologous to the amino 
terminus of bikunin (Brinkmann et al, 1997, I.e.), was used as acceptor in the XT activity 
assay. After incubation for 75 min at 37°C the enzyme was heat-inactivated by incubation for 
15 min at 65°C and the reaction mixture was used for a- "and P-xylosidase treatment and 

10 alkaline p-elimination. For the linkage-specific digestion of the bound [ l4 C]-D-xylose 4 mU of 
ct-xylosidase (Seikagaku Corporation, Tokyo, Japan) or 4 mU of p-xylosidase (Sigma, 
Dreieich, Germany) were added to the samples and incubated for 60 min at 37°C. The reaction 
mixtures were then placed on Immobilon-AV membrane discs and allowed to dry. The discs 
were washed with trichloroacetic acid as described above and the remaining incorporated 

15 radioactivity was determined against appropriate controls. The alkaline cleavage of the O- 
glycosidic linkage between the [ l4 C]-D-xylose and the p-hydroxyamino acid serine was 
performed as described elsewhere (Montreuil et al, 1994). Briefly, the reaction mixture was 
adjusted to pH 10 with diluted NaOH and an equal volume of cold, freshly prepared 
sodiumboro-hydride solution (2 M sodiumborohydride in 0.1 M NaOH) and was added. After 

20 incubation at 45°C for 16 h the cooled solution was neutralized by adding 50% acetic acid and 
placed on Immobilon-AV membrane discs. After drying the discs were washed with 
trichloroacetic acid and the remaining radioactivity was measured by liquid scintillation 
counting. 
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Patent Claims 

1 . An isofoim of UDP-D-xylose:proteoglycan core protein JS~D-xylosyltransferase(XT). 

5 2. A protein comprising an amino acid sequence of the isoform of claim 1 or a portion thereof, 
having the biological activity of XT. 

3. A protein of claim 1 or 2 deriving from human or rat sources (hXT, rXT) 

10 4. An isolated protein according to claim 3, wherein said hXT has a molecular weight of 
120.000 under SDS polyacrlylamide gel electrophoresis conditions. 

5. A recombinant protein of claim 3, wherein said protein is hXT-I comprising at least 827 
amino acids. 

15 

6. A protein according claim 5, having the amino acid sequence-as depicted in Fig. 7B. 

7. A protein according to claim 3, wherein said protein is hXT-II comprising 865 amino acids 
and is an isoform of hXT-I. 

20 

8. A protein according to claim 7 exhibiting approximately 55% overall sequence identity to 
hXT-I 

9. A protein according claim 8, having the amino acid sequence as depicted in Fig. 8B. 

25 

1 0. A DNA sequence coding for a protein of any of the claims 1-9. 

1 1 . A DNA sequence according to claim 10 comprising the nucleotide sequence coding for 
hXT-I as depicted in Fig. 7A, or rXT-I as depicted in Fig. 9. 

30 

12. A DNA sequence according to claim 10 comprising the nucleotide sequence coding for 
hXT-II as depicted in Fig. 8 A or rXT-II as depicted in Fig. 10. 
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13. An expression vector comprising a promoter sequence, a DNA sequence of claim 10 and 
optionally a signal sequence. 

14. An expression host cell comprising a vector of claim 13, said host cell being capable of 
expressing a protein of any of the claims 1-9. 

15. An antibody directed against a protein as defined in any of the claims 1-9. 

16. A process for isolating and purifying a protein as defined in claims 1-4, characterized by 
the following steps: 

(i) culturing cells having an increased level of XT, and harvesting the 
supernatant of said cell culture, 

(ii) fractionated ammonium sulfate precipitation of the supernatant of step (i), 

(iii) heparin affinity chromatography of the precipitate of step (ii), 

(iv) ion exchange chromatography of the step (iii) product, 

(v) affinity chromatography of the step (iv) product, and 

(vi) SDS-Polyacrylamide gel electrophoresis of step (v). 

17. A pharmaceutical composition comprising a protein of any of the claims 1-9 and a 
pharmacologically acceptable carrier, diluent or excipient. 

1 8. Use of a protein of any of the claims 1 - 9 for the manufacture of a medicament for the 
treatment of sclerotic diseases and chronic inflammatory joint diseases. 

19. Use of claim 18, wherein said medicament is an inhibitor or antagonist of said protein. 

20. Use of a protein of any of the claims 1 - 9 as diagnostic marker. 

21 . Use of a DNA molecule of any of the claims 10 - 12 as gene marker. 

22. Use of an antibody as defined in claim 15 in an immunological assay for determination of 
protein having the biological activity of hXT as diagnostic tool. 
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Fig. 3 
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Fig. 4 
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Fig. 6 

® 



SPPS1 
Peptide 4 
Inv2b 
Inv2c 



5'-ATH GCI AGY CCI CCI AGY GA-3' 

He Ala Ser Pro Pro Ser Asp Phe Gly Arg 
5* -TAD CGN AG I GGN GGI AGI CT-3' 
5»-TAD CGN TCR GGI GGI TCR CT-3' 



DF1 5 1 -GAY TTY GAR AAY GTI GAY AA-3' 

Peptide 6 Asp Phe Glu Asn Val Asp Asn Ser Asn Phe Ala Pro Arg 

PFF-sense 5' -CCI ACI TTY TTY GCN CG-3' 
Peptide 7 Pro Thr Phe Phe Ala Arg 

Peptide 9 Glu Leu Gly Ala Val Lys Pro Asp Gly Arg Leu 

Lysc-invl 3'-CCN CGI CAN TTY GGN CTR CC-5 1 



® 



DF1 

0 



Inv2b/lnv2c 
**- 

PFF-sense SPPS1 Lyso-inv1 
+ -► 

0 0S 



3726 



Peptide | 



XT-I cDNA 

pCG111-4 
PCG114-29 
PCG176-1 
PCG204-38 
pCG1 85-21 



© 



PFF-sense 

0 



Lysc-inv1 



3608 



Peptide | 



XT-ll cDNA 

pCG110-7 
PCG211-4 
PCG212-19 
pCG319-23 
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Fig. 7A 

ACTCAGGATGGCTACT TT T CTC AT CGGCCGAAAGAGAAAGT GCGAACAGACAGCAACAACGAGAACTCT 

GTCCCCAAAGACTTTGAGAATGT GGACAAC AGCAACTT CGCACCCAGGACT CAAAAGCAGAAGCACCAG 

CCTGAGTTGGCGAAGAAGCCACCGAGTAGACAGAAGGAGCTTTTGAAAAGGAAGCTGGAACAGCAGGAG 

AAAGGAAAAGGACATACATTCCCTGGGAAAGGCCCCGGTGAGGTGCTGCCTCCCGGGGACAGAGCCGCA 

GCCAACAGCAGCCACGGGAAGGATGTGTCCAGACCGCCTCATGCCAGGAAAACTGGGGGCAGCTCCCCC 

GAGACCAAGTATGACCAGCCCCCTAAGTGTGACATCTCAGGCAAGGAGGCCATCTCTGCCCTGTCCCGT 

GCTAAGTCCAAGCACTGCCGCCAGGAGATTGGGGAGACTTACTGCCGCCACAAGTTAGGGCTGCTGATG 

CCTGAGAAGGTGACTCGGTTCTGCCCCCTCGAGGGTAAAGCCAACAAGAACGTGCAGTGGGACGAGGAC 

TCCGTGGAGTACATGCCAGCCAACCCGGTCAGAATCGCCTTTGTCCTGGTGGTCCACGGCCGTGCCTCT 

CGGCAGTTGC AGCGCAT GT TC AAGGC CATCT ACCACAAAGACCAC TT CTACT AC AT CCACGT GGACAAG 

CGCTCTAATTACCTGCATCGGCAAGTGCTCCAGGTCTCCAGGCAGTACAGCAATGTCCGCGTCACCCCC 

TGGAGAATGGCCACCATCTGGGGAGGAGCCAGCCTCCTGTCCACCTACCTGCAGAGCATGCGGGACCTC 

CTGGAGATGACCGACTGGCCCTGGGACTTCTTCATCAACCTGAGTGCGGCCGACTACCCCATCAGGACA 

AATGACCAGTTGGTGGCGTTTCTCTCCCGATACCGAGATATGAATTTCTTGAAGTCACACGGCCGGGAC 

AATGCAAGGTTCATTCGGAAGCAGGGCCTGGATCGGCTCTTCCTGGAGTGCGACGCTCACATGTGGCGC 

CTGGGAGATCGGCGGATCCCAGAGGGCATTGCCGTGGATGGCGGTTCGGACTGGTTCCTGCTGAACCGG 

AGGTTTGTAGAATATGTGACCTTCTCCACAGACGATCTGGTGACCAAGATGAAACAGTTCTACTCCTAC 

ACCCTGCTTCCTGCTGAGTCCTTCTTCCATACGGTCCTGGAGAACAGCCCCCACTGCGACACCATGGTG 

GACAACAACCTGCGCATCACCAAC TGGAATCGCAAGCTGGGC TGCAAG TGC CAG TACAAGCACAT CGTG 

GACTGGTGCGGCTGCTCCCCCAATGACTTCAAGCCGCAGGACTTCCACCGCTTCCAGCAGACAGCCCGG 

CCTACCTTCTTTGCCCGCAAGTTTGAAGCCGTGGTGAATCAGGAAATCATTGGGCAGCTGGACTATTAC 

CTGTACGGGAACTACCCTGCAGGTACCCCGGGCCTGCGCTCCTACTGGGAGAATGTCTACGATGAGCCT 

GACGGCATCCACAGCCTGAGCGACGTGACACTCACCTTGTACCACTCCTTTGCCCGCCTGGGTCTTCGA 

CGGGCTGAGACGTCCCTGCACACGGATGGGGAGAACAGCTGCCGATACTACCCAATGGGCCACCCAGCA 

TCTGTGCACCTCTACTTCCTTGCTGACCGCTTCCAGGGCTTTCTGATCAAGCATCATGCTACCAATCTG 

GCTGTGAGCAAACTAGAGACTCTGGAGACCTGGGTGATGCCGAAAAAAGTCTTCAAGATCGCAAGCCCA 

CCCAGTGACTTTGGGAGGCTTCAGTTTTCCGAGGTCGGCACTGACTGGGATGCCAAGGAGAGGCTATTC 

CGCAACTTTGGGGGTCTTCTGGGGCCCATGGATGAGCCGGTGGGTATGCAGAAGTGGGGGAAGGGACCT 

AATGTGACCGTGACCGTCATTTGGGTGGATCCCGTCAATGTCATCGCAGCCACCTACGACATCCTCATT 

GAGTCCACTGCCGAATTCACACACTACAAGCCCCCTTTGAACTTGCCCCTGAGGCCTGGGGTCTGGACA 

GTGAAAATTCTCCACCACTGGGTGCCAGTTGCAGAGACCAAATTCCTCGTTGCGCCTCTGACCTTCTCG 

AACAGGCAGCCCATCAAACCTGAGGAGGCACTGAAGCTGCACAATGGGCCCCTCCGCAATGCCTACATG 

GAGCAGAGCTTCCAGAGCCTAAACCCCGTCCTCAGCCTGCCCATCAACCCCGCCGAGGTGGAACAGGCA 

CGGAGGAACGCAGCCTCCACGGGCACAGCGCTGGAGGGATGGCTGGACTCGTTGGTGGGCGGGATGTGG 

ACTGCCATGGACATCTGTGCCACGGGCCCCACAGCCTGCCCGGTCATGCAGACCTGCAGCCAGACGGCC 

TGGAGCTCCTTCAGCCCTGACCCCAAGTCGGAGCTGGGTGCCGTCAAACCTGATGGCCGGCTCAGGTAG 

CACTGGGCACGAGGAGTGGGCCACAGCAGGATCTCAACGGGAAAGCAGCCAGAGGGGTTGTGGGGCCTG 

AACCCCGGCCTCCACCCTGGGGGAGGCCCTCTGTGAATGGGTCTCTCCTGGCCATAGAATGATGGAAAG 

GGAAGGTCAGCAGGTCAAAGCAGGATCAGCCAACAACCTGCCTTTGGCAAGCTGCGGGTGGGATGGCTC 

AGTCCCTGCACTGTGACTGTCTCACCTCTTCTGSTTGATCCTCAAGTCCTACAGGTTCCTTGTCTTCCC 

CTTCCAGTGACCCACCCCTGACCCCAGACGTGTGATTTTCAGACTTTTCTTTCGAGCAGCAGAACTTCG 

TTTACGGAGCACAGTCATAAGTGRAGGTTCAGGGTGCTGACGAAATCCAAGCTGCTCTGGTTGAAGCTG 

ACAAGTGCGAGGTTCCCTCCCAAAGCTCAGCCCTCTGGGCGGTCCCCTTGCCCAGGGTATCTCCTACGG 

TACCTCTTCAGAACCCAAGGGCTCTGCAAATGCCAGTTTGACAAGCACTGCCCAGACCAACCATGGGTT 

CAAACTCCAGCCCTGCCCTTTGGTTCATTTTTCTGCTTCTCTTGGCTGGGGGACTCTGGTGCCAGCCTT 

GAAAGTCATGGTCGTGGGCCCTTTCCCATGGAGGCTGCAGCCTTAGGAGAGCTCTGAGCCTCTCAGCAG 

CCCTCCTTGGGTTGAACTATTCTCCTTAGTAACTAGGTAAGTGGGAAAGCCTTTTGATGTGGCATGGCC 

AAGGTCCAGCCACAAGTGCAACTGCCACCTGTCCAGGGGTCTGGGCCTCCTTCCCTCAAGGCTGCCACA 

CAAAGTAGC AGAAAT AGGAT GATGTTTGTGAGCACCAGACT CAAGAC CATGAC CT TCT TT GATCC TT GA 

AAAT GGGAACT T TGAC AGCC ATGACCATGAAACTCACAAGGCAACGC GGAT GAAACT C ACAAAGCAATG 

CTTGGAGCAAAACTCCTGAGCTAGACAGCACAGCAGCACCCATCCCCTGCCAGAGCCCTTCCGTTTGAG 

GTCAGACACACAAAACCT TCGTCAAT TGCACAC CGGTGCTGT TGGGAGT GACC AAACCACATGAACC AG 

ACTTTTCCCGTCCAGGAAATAGCATTTCAGATTTGGTTTTTAATTTCATGCCCTTCGGCCACAGGCTCA 

ACGGGACATGCAACATAAAAATGGGAAGGTTATTCAACCCGCGGGAAAAAAAAAAAAAAAAAAAAAAAA 
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Fig. 7B 

Thr Gin Asp Gly Tyr Phe Ser His Arg Pro Lys Glu Lys Val Arg Thr 
Asp Ser Asn Asn Glu Asn Ser Val Pro Lys Asp Phe G lu Asn Val Asp 
Asn Ser Asn Phe Ala Pro Arg Thr Gin Lys Gin Lys His Gin Pro Glu 
Leu Ala Lys Lys Pro Pro Ser Arg Gin Lys Glu Leu Leu Lys Arg Lys 
Leu Glu Gin Gin Glu Lys Gly Lys Gly His Thr Phe Pro Gly Lys Gly 
Pro Gly Glu Val Leu Pro Pro Gly Asp Arg Ala Ala Ala Asn Ser Ser 
His Gly Lys Asp Val Ser Arg Pro Pro His Ala Arg Lys Thr Gly Gly 
Ser Ser Pro Glu Thr Lys Tyr Asp Gin Pro Pro Lys Cys Asp He Ser 
Gly Lys Glu Ala He Ser Ala Leu Ser Arg Ala Lys Ser Lys His Cys 
Arg Gin Glu He Gly Glu Thr Tyr Cys Arg His Lys Leu Gly L eu Leu 
Met Pro Glu Lys Val Thr Arg Phe Cys Pro Leu Glu Gly Lys Ala Asn 
Lys Asn Val Gin Trp Asp Glu Asp Ser Val Glu Tyr Met Pro Ala Asn 
Pro Val Arg He Ala Phe Val Leu Val Val His Gly Arg Ala Ser Arg 
Gin Leu Gin Arg Met Phe Lys Ala He Tyr His Lys Asp His Phe Tyr 
Tyr He His Val Asp Lys Arg Ser Asn Tyr Leu His Arg Gin Val Leu 
Gin Val Ser Arg Gin Tyr Ser Asn Val Arg Val Thr Pro Trp Arg Met 
Ala Thr He Trp Gly Gly Ala Ser Leu Leu Ser Thr Tyr Leu Gin Ser 
Met Arg Asp Leu Leu Glu Met Thr Asp Trp Pro Trp Asp Phe Phe He 
Asn Leu Ser Ala Ala Asp Tyr Pro He Arg Thr Asn Asp Gin Leu Val 
Ala Phe Leu Ser Arg Tyr Arg Asp Met Asn Phe Leu Lys Ser His Gly 
Arg Asp Asn Ala Arg Phe He Arg Lys Gin Gly Leu Asp Arg Leu Phe 
Leu Glu Cys Asp Ala His Met Trp Arg Leu Gly Asp Arg Arg He Pro 
Glu Gly He Ala Val Asp Gly Gly Ser Asp Trp Phe Leu Leu Asn Arg 
Arg Phe Val Glu Tyr Val Thr Phe Ser Thr Asp Asp Leu Val Thr Lys 
Met Lys Gin Phe Tyr Ser Tyr Thr Leu Leu Pro Ala Glu Ser Phe Phe 
His Thr Val Leu Glu Asn Ser Pro His Cys Asp Thr Met Val Asp Asn 
Asn Leu Arg He Thr Asn Trp Asn Arg Lys Leu Gly Cys Lys Cys Gin 
Tyr Lys His He Val Asp Trp Cys Gly Cys Ser Pro Asn Asp Phe Lys 
Pro Gin Asp Phe His Arg Phe Gin Gin Thr Ala Arg Pro Thr Phe Phe 
Ala Arg Lys Phe Glu Ala Val Val Asn Gin Glu He He Gly Gin Leu 
Asp Tyr Tyr Leu Tyr Gly Asn Tyr Pro Ala Gly Thr Pro Gly Leu Arg 
Ser Tyr Trp Glu Asn Val Tyr Asp Glu Pro Asp Gly He His Ser Leu 
Ser Asp Val Thr Leu Thr Leu Tyr His Ser Phe Ala Arg Leu Gly Leu 
Arg Arg Ala Glu Thr Ser Leu His Thr Asp Gly Glu Asn Ser Cys Arg 
Tyr Tyr Pro Met Gly His Pro Ala Ser Val His Leu Tyr Phe Leu Ala 
Asp Arg Phe Gin Gly Phe Leu He Lys His His Ala Thr Asn Leu Ala 
Val Ser Lys Leu Glu Thr Leu Glu Thr Trp Val Met Pro Lys Lys Val 
Phe Lys Ile Ala Ser Pro Pro Ser Asp Phe Gly Arg Leu Gin Phe Ser 
Glu Val Gly Thr Asp Trp Asp Ala Lys Glu Arg Leu Phe Arg Asn Phe 
Gly Gly Leu Leu Gly Pro Met Asp Glu Pro Val Gly Met Gin Lys Trp 

Gly Lys Gly Pro Mfi Va * Thr Val Thr Val Ile Tr P Val As P Pro Val 
Asn Val Ile Ala Ala Thr Tyr Asp Ile Leu Ile Glu Ser Thr Ala Glu 
Phe Thr His Tyr Lys Pro Pro Leu Asn Leu Pro Leu Arg Pro Gly Val 
Trp Thr Val Lys Ile Leu His His Trp Val Pro Val Ala Glu Thr Lys 
Phe Leu Val Ala Pro Leu Thr Phe Ser Asn Arg Gin Pro Ile Lys Pro 
Glu Glu Ala Leu Lys Leu His Asn Gly Pro Leu Arg Asn Ala Tyr Met 
Glu Gin Ser Phe Gin Ser Leu Asn Pro Val Leu Ser Leu Pro Ile Asn 
Pro Ala Gin Val Glu Gin Ala Arg Arg Asn Ala Ala Ser Thr Gly Thr 
Ala Leu Glu Gly Trp Leu Asp Ser Leu Val Gly Gly Met Trp Thr Ala 
Met Asp He Cys Ala Thr Gly Pro Thr Ala Cys Pro Val Met Gin Thr 
Cys Ser Gin Thr Ala Trp Ser Ser Phe Ser Pro Asp Pro Lys Ser Glu 
Leu Gly Ala Val Lys Pro Asp Gly Arg Leu Arg 
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Fig.8A 

CGCAGGCCCCGCCCCCGGCCCCGCCCCGGCGCGCGCCCGGCCCGCCTCCCTAGGCGTGGAGGAGGGGGGG 

CGGCTCAGCCCCGCGCCCGTGCGCGCGCTCGCGGCCGGGTTGCAGGGCTGGGCGCGCGCCCCGCGTCCCG 

GGCAGGAAGATGGTGGCGAGCGCGCGAGTGCAGAAGCTGGTGCGGCGCTACAAGCTGGCGATTGCCACGG 

CGCTGGCCATCCTGCTGCTGCAGGGCCTGGTAGTGTGGAGCTTCAGCGGCCTGGAGGAGGACGAGGCGGG 

CGAGAAAGGAAGGCAGAGGAAGCCACGGCCACTGGACCCTGGCGAAGGTTCCAAGGACACAGACAGTTCA 

GCAGGGCGACGGGGCAGCACAGGCAGAAGGCATGGGCGCTGGCGGGGCCGTGCTGAGAGCCCAGGAGTGC 

CCGTGGCCAAGGTGGTACGGGCAGTAACCAGCCGGCAGAGAGCCAGCCGGCGGGTCCCACCTGCCCCACC 

TCCGGAAGCCCCAGGCCGCCAGAACCTGAGTGGGGCAGCAGCTGGGGAGGCGCTGGTAGGGGCAGCTGGC 

TTCCCACCACACGGAGATACAGGGAGCGTGGAGGGCGCCCCCCAGCCCACGGACAATGGCTTCACCCCCA 

AGTGCGAGATCGTGGGCAAGGACGCACTGTCTGCACTGGCCCGGGCCAGCACCAAGCAGTGCCAGCAGGA 

GATCGCCAATGTGGTGTGCCTGCACCAGGCTGGGAGCCTCATGCCCAAGGCTGTGCCCCGGCACTGTCAG 

CTGACTGGGAAGATGAGCCCCGGCATCCAGTGGGATGAGAGCCAAGCCCAGCAGCCCATGGATGGCCCCC 

CGGTGCGAATCGCCTACATGCTGGTGGTTCACGGCCGCGCCATCCGCCAGCTGAAGCGTCTCCTCAAGGC 

CGTTTATCACGAGCAGCACTTCTTTTACATCCATGTGGACAAGCGTTCCGACTACCTGCACCGGGAGGTG 

GTGGAGCTGGCCCAGGGCTATGATAACGTGCGGGTGACGCCCTGGCGCATGGTTACCATCTGGGGCGGGG 

CCAGCCTCCTGACGATGTACCTGCGGAGCATGCGGGACCTGCTAGAGGTGCCTGGCTGGGCCTGGGACTT 

CTTCATCAACCTCAGTGCCACTGACTATCCAACCAGGACCAATGAGGAGCTGGTGGCATTCCTATCCAAG 

AACCGGGACAAGAATTTCCTCAAGTCACATGGCCGGGACAACTCCAGGTTCATCAAGAAACAGGGCCTGG 

ACCGGCTCTTCCATGAGTGCGACTCACACATGTGGCGCCTGGGCGAGCGGCAGATCCCAGCAGGCATTGT 

GGTGGATGGCGGTTCTGACTGGTTCGTGCTGACACGCAGCTTTGTGGAGTATGTGGTGTACACAGATGAC 

CCGCTTGTGGCCCAGCTGCGCCAGTTCTACACATACACACTGCTCCCAGCCGAGTCCTTCTTCCACACGG 

TGCTGGAGAACAGCCTGGCCTGTGAGACCCTCGTGGACAACAACCTGCGGGTCACCAACTGGAACCGCAA 

GCTGGGCTGCAAGTGCCAGTACAAGCACATTGTGGACTGGTGTGGCTGCTCCCCCAACGACTTCAAGCCA 

CAGGACTTCCTCCGGCTGCAGCAAGTCTCCAGACCCACCTTCTTCGCCCGGAAGTTCGAGTCGACTGTGA 

ACCAGGAGGTGCTGGAAATCCTGGACTTCCACCTGTACGGCAGCTACCCCCCCGGCACGCCAGCCCTCAA 

GGCCTACTGGGAGAACACCTACGACGCGGCTGATGGCCCCAGTGGGCTCAGTGATGTCATGCTCACTGCT 

TACACAGCCTTCGCCCGCCTCAGCCTGCACCATGCCGCCACTGCTGCACCCCCAATGGGCACCCCACTCT 

GCAGGTTTGAGCCCAGGGGCTTGCCGTCCAGCGTGCACCTGTATTTCTATGACGACCATTTCCAGGGCTA 

CCTGGTGACGCAGGCGGTGCAGCCCTCAGCCCAGGGGCCGGCAGAGACGCTTGAGATGTGGCTGATGCCC 

CAAGGGTCGCTGAAGCTGTTGGGGCGCAGTGACCAGGCCAGCCGGCTCCAGAGTCTGGAGGTTGGCACTG 

ATTGGGACCCCAAAGAGCGTCTTTTCCGGAACTTTGGGGGGTTACTGGGGCCGCTGGACGAGCCTGTGGC 

CGTGCAGCGCTGGGCCCGGGGCCCCAACCTCACAGCCACAGTGGTCTGGATCGACCCAACCTATGTGGTG 

GCCACATCTTATGACATCACAGTAGATACGGAGACTGAGGTCACGCAATACAAGCCCCCACTGAGCCGGC 

CCCTGCGGCCAGGGCCCTGGACTGTTCGACTCCTTCAGTTCTGGGAACCGCTGGGTGAGACCCGCTTCCT 

TGTGCTGCCCTTGACCTTCAACCGCAAACTACCTCTCAGGAAAGATGATGCCAGCTGGCTGCACGCAGGG 

CCACCCCACAACGAGTACATGGAGCAGAGTTTCCAGGGCCTGAGTAGCATCCTGAACCTGCCTCAGCCGG 

AGCTCGCGGAGGAGGCTGCCCAGCGGCACACACAGCTCACAGGCCCTGCGCTCGAGGCCTGGACAGACAG 

GGAACTGAGCAGCTTCTGGTCCGTGGCTGGACTGTGTGCCATAGGCCCCTCTCCCTGCCCCTCCCTGGAG 

CCCTGCAGACTGACCAGCTGGAGCTCTCTGTCCCCCGACCCCAAATCAGAGCTGGGGCCTGTCAAAGCAG 

ACGGGCGACTCAGGTAGCAGGGCCCCAGCCAGTACCCGTGGAGGACCCGGGAAATTGCACCTTACAGACA 

GTGGAGGGGTGTCCCCTCCCACAGGCAAGAACCAGAGGCCCAGGCTGCACACCCATTTCAGCCATCAAGA 

ACCCACACAGAC GGCAGGGAAGGT GGACACAGT ATGAACTACT GC TGATGT CTCT GT TGGGGATCAGAGG 

GCTGGCGGGAACGCGAG7VAGGGCACCAGCAGCATTCCACACCCAGCTCTTCCTCACCTTCCTGTCTAGTT 

TGAATTTCTTTTTTTTCTTTCTTTTTTTTTTTTTTTTAATTTAAAAAGGAAAATGGGTGGTTGGGAGAGA 

AACTAGAACAGAAGAT GTGCAATAGGGC TCAGAGCAGCCCCAG GAAGTGGGCC AT GT C TGTGGGGAGCAG 

TGGCTTGCTGAGCCCACCTGCTATTGTTCTGCACGAGGCTGGGCCTGCCTCACTCTCCAGGGCCTCATGC 

CCCATTCTGGGCCTGTGGTGCTCGTGGCTGAGGCTCCACAGGGCTGCAAGTGCCCTGCCAGGCTCTAAGG 

CCCGAAGAACAGGTGATATCGGGGGCGCTGCAGAGCTGGGCTCTAGCAGGACAGTTCTTTTGTAGCCAGG 

GGACCCTAGAAGTGGGGTGGGGCGGCTCCAGGGCTTTCCAGCATACCCTGCCCCTGGATGGGAAAGGCAG 

GGTCAGGGCCCACTGAGACCCATGCAGAGTTTCCAGGTGTGCACTGGAAGTGAGGTCACATGAGCAGCGT 

GGGAAGAAGAC T C T GT CAAGAC TC TCAGAAG AACCTGGAGTAAT T GTGCCTGAAG C T CAGCGTGAAGT C T 

GAAATATGCAACAGAAGAAATATATCTCTATCTCTCTA 
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Fig.SB, 



Met Val Ala Ser Ala Arg Val Gin Lys Leu Val Arg Arg Tyr Lys Leu Ala 
He Ala Thr Ala Leu Ala lie Leu Leu Leu Gin Gly Leu Val Val Trp Ser 
Phe Ser Gly Leu Glu Glu Asp Glu Ala Gly Glu Lys Gly Arg Gin Arg Lys 
Pro Arg Pro Leu Asp Pro Gly Glu Gly Ser Lys Asp Thr Asp Ser Ser Ala 
Gly Arg Arg Gly Ser Thr Gly Arg Arg His Gly Arg Trp Arg Gly Arg Ala 
Glu Ser Pro Gly Val Pro Val Ala Lys Val Val Arg Ala Val Thr Ser Arg 
Gin Arg Ala Ser Arg Arg Val Pro Pro Ala Pro Pro Pro Glu Ala Pro Gly 
Arg Gin Em Leu Ser Gly Ala Ala Ala Gly Glu Ala Leu Val Gly Ala Ala 
Gly Phe Pro Pro His Gly Asp Thr Gly Ser Val Glu Gly Ala Pro Gin Pro 
Thr Asp Asn Gly Phe Thr Pro Lys Cys Glu He Val Gly Lys Asp Ala Leu 
Ser Ala Leu Ala Arg Ala Ser Thr Lys Gin Cys Gin Gin Glu He Ala Asn 
Val Val Cys Leu His Gin Ala Gly Ser Leu Met Pro Lys Ala Val Pro Arg 
His Cys Gin Leu Thr Gly Lys Met Ser Pro Gly He Gin Trp Asp Glu Ser 
Gin Ala Gin Gin Pro Met Asp Gly Pro Pro Val Arg He Ala Tyr Met Leu 
Val Val His Gly Arg Ala He Arg Gin Leu Lys Arg Leu Leu Lys Ala Val 
Tyr His Glu Gin His Phe Phe Tyr He His Val Asp Lys Arg Ser Asp Tyr 
Leu His Arg Glu Val Val Glu Leu Ala Gin Gly Tyr Asp Asn Val Arg Val 
Thr Pro Trp Arg Met Val Thr lie Trp Gly Gly Ala Ser Leu Leu Thr Met 
Tyr Leu Arg Ser Met Arg Asp Leu Leu Glu Val Pro Gly Trp Ala Trp Asp 
Phe Phe He Ma Leu Ser Ala Thr Asp Tyr Pro Thr Arg Thr Asn Glu Glu 
Leu Val Ala Phe Leu Ser Lys Asn Arg Asp Lys Asn Phe Leu Lys Ser His 
Gly Arg Asp Asn Ser Arg Phe He Lys Lys Gin Gly Leu Asp Arg Leu Phe 
His Glu Cys Asp Ser His Met Trp Arg Leu Gly Glu Arg Gin He Pro Ala 
Gly He Val Val Asp Gly Gly Ser Asp Trp Phe Val Leu Thr Arg Ser Phe 
Val Glu Tyr Val Val Tyr Thr Asp Asp Pro Leu Val Ala Gin Leu Arg Gin 
Phe Tyr Thr Tyr Thr Leu Leu Pro Ala Glu Ser Phe Phe His Thr Val Leu 
Glu Asn Ser Leu Ala Cys Glu Thr Leu Val Asp Asn Asn Leu Arg Val Thr 
Asn Trp Asn Arg Lys Leu Gly Cys Lys Cys Gin Tyr Lys His He Val Asp 
Trp Cys Gly Cys Ser Pro Asn Asp Phe Lys Pro Gin Asp Phe Leu Arg Leu 
Gin Gin Val Ser Arg Pro Thr Phe Phe Ala Arg Lys Phe Glu Ser Thr Val 
Asn Gin Glu Val Leu Glu He Leu Asp Phe His Leu Tyr Gly Ser Tyr Pro 
Pro Gly Thr Pro Ala Leu Lys Ala Tyr Trp Glu Asn Thr Tyr Asp Ala Ala 
Asp Gly Pro Ser Gly Leu Ser Asp Val Met Leu Thr Ala Tyr Thr Ala Phe 
Ala Arg Leu Ser Leu His His Ala Ala Thr Ala Ala Pro Pro Met Gly Thr 
Pro Leu Cys Arg Phe Glu Pro Arg Gly Leu Pro Ser Ser Val His Leu Tyr 
Phe Tyr Asp Asp His Phe Gin Gly Tyr Leu Val Thr Gin Ala Val Gin Pro 
Ser Ala Gin Gly Pro Ala Glu Thr Leu Glu Met Trp Leu Met Pro Gin Gly 
Ser Leu Lys Leu Leu Gly Arg Ser Asp Gin Ala Ser Arg Leu Gin Ser Leu 
Glu Val Gly Thr Asp Trp Asp Pro Lys Glu Arg Leu Phe Arg Asn Phe Gly 
Gly Leu Leu Gly Pro Leu Asp Glu Pro Val Ala Val Gin Arg Trp Ala Arg 
Gly Pro Asn Leu Thr Ala Thr Val Val Trp He Asp Pro Thr Tyr Val Val 
Ala Thr Ser Tyr Asp He Thr Val Asp Thr Glu Thr Glu Val Thr Gin Tyr 
Lys Pro Pro Leu Ser Arg Pro Leu Arg Pro Gly Pro Trp Thr Val Arg Leu 
Leu Gin Phe Trp Glu Pro Leu Gly Glu Thr Arg Phe Leu Val Leu Pro Leu 
Thr Phe Asn Arg Lys Leu Pro Leu Arg Lys Asp Asp Ala Ser Trp Leu His 
Ala Gly Pro Pro His Asn Glu Tyr Met Glu Gin Ser Phe Gin Gly Leu Ser 
Ser He Leu Asn Leu Pro Gin Pro Glu Leu Ala Glu Glu Ala Ala Gin Arg 
His Thr Gin Leu Thr Gly Pro Ala Leu Glu Ala Trp Thr Asp Arg Glu Leu 
Ser Ser Phe Trp Ser Val Ala Gly Leu Cys Ala He Gly Pro Ser Pro Cys 
Pro Ser Leu Glu Pro Cys Arg Leu Thr Ser Trp Ser Ser Leu Ser Pro Asp 
Pro Lys Ser Glu Leu Gly Pro Val Lys Ala Asp Gly Arg Leu Arg 
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Fig. 9 



TACTTCTCTCACCGGCCCAAAGAGAAAGTTCGGACCGACAGCAACAATGAGAACTCAGTCCCCAA 
GGACTTTGAGAATGTGGACAACAGCAACTTCGCACCCAGGACTCAAAAGCAGAAGCACCAGCCAG 
AGTTGGCAAAGAAGCCCCTCAGCAGGCAGAAAGAGCGTTTGCAGAGAAAGCTGGGCGCCCAGGAC 
T^AAGGACAGGGGCAGTCAGTCCTAGGAAAAGGCCCCAAGGAGGTGCTGCCTCCTCGGGAAAAAGC 
CCCAGGCAACAGTAGCCAAGGGAAGGATCTCTCAAGACACAGCCATAGCAGGAAGAGTGGTGGCG 
GTGGGTCCCCGGAAACTAAGTCTGACCAGGTCCCCAAGTGTGACATCTCTGGCAAGGAGGCCATC 
TCAGCGCTGACCCGCGCTAAGTCCAAGCACTGTCGCCAGGAGATTGCAGAAACCTACTGTCGCCA 
CAAGCTGGGGCTGCTGATGCCAGAGAAGGTGGCTCGATTCTGTCCCCTGGAAGGCAAAGCCAACA 
AGAATGTCCAGTGGGATGAGGATGCTGTTGAGTACATGCCCCCCAACCCAGTCAGAATCGCCTTT 
GTCCTGGTGGTCCATGGCCGTGCCTCTCGACAGCTGCAGCGCATGTTTAAGGCCATCTACCACAA 
AGACCATTTCTACTATATCCATGTGGATAAGCGTTCCAATTACCTGCATCGGCAAGTGCTCCAGT 
TCTCCAGGCAGTACGACAACGTCCGAGTCACTTCCTGGAGGATGGCCACCATTTGGGGCGGAGCC 
AGCCTCCTGTCCACTTACCTGCAGAGCATGCGGGATCTACTGGAGATGACTGACTGGCCCTGGGA 
CTTCTTCATCAACCTGAGTGCGGCTGACTACCCCATCAGGACAAATGACCAGCTCGTAGCATTTC 
TTTCCAGATACCGAGATATGAATTTCCTGAAGTCACATGGTCGGGACAATGCAAGGTTCATCCGG 
AAGCAGGACCTGGACCGTCTCTTCCTGGAGTGTGATACACACATGTGGCGCCTGGGGGACCGGCG 
AATCCCAGAGGGCATTGCTGTAGATGGCGGCTCTGATTGGTTCCTGCTAAACCGGAAGTTTGTAG 
AGTATGTGGCATTCTCCACAGATGACCTGGTGACCAAGATGAAGCAGTTCTACTCTTATACCCTC 
CTCCCTGCTGAGTCCTTTTTCCACACGGTCCTGGAGAACAGCCCCCACTGTGACACCATGGTGGA 
TAACAACCTGCGCATCACCAATTGGAACCGCAAGCTGGGCTGCAAGTGCCAGTACAAGCATATTG 
TGGACTGGTGTGGCTGCTCTCCCAATGACTTCAAGCCTCAGGACTTCCATCGCTTCCAGCAGACG 
GCCCGGCCCACCTTCTTTGCCCGAAAGTTCGAAGCCATAGTGAACCAGGAAATCATTGGGCAGTT 
GGACTCTTACCTGTATGGCAACTATCCTGCGGGCACCCCGGGCCTCCGCTCTTACTGGGAGAATG 
TCTATGACGAACCAGATGGCATCCAGAGCCTCAGCGATGTGGCACTCACCATGTATCATTCCTTC 
ATCCGCCTGGGTCTTCGAAGGGCTGAGTCATCGCTACACACGGATGGGGAGAACAGCTGCAGGTA 
CTATCCAATGGGCCACCCAGCGTCTGTCCACCTCTACTTCCTGGCTGACCGATTCCAGGGCTTTC 
T GAT C AAG CACCAT GT GACC AACC T T GCTGT GAGCAAACT G GAGACACT GGAGACAT GGAT GAT G 
CCAAAGAAAGTCTTCAAGGTCGCAAGTCCCCCCAGTGACTTTGGGAGGCTTCAGTTTTCTGAGGT 
TGGCACTGACTGGGATGCCAAGGAGAGACTATTCCGGAACTTTGGTGGTCTTTTGGGGCCCATGG 
ATGAGCCGGTGGGAATGCAGAAATGGGGGAAGGGACCCAATGTGACCGTGACTGTTATTTGGGTG 
GATCCTGTCAATGTCATCGCAGCCACCTATGATATCCTGATCGAGTCCACTGCGGAATTCACACA 
CTACAAACCCCCTTTGAATCTGCCTCTGAGGCCTGGTGTCTGGACAGTGAAGATTCTCCATCACT 
GGGTACCAGTTGCAGAGACCAAATTCCTTGTCGCACCTCTGACCTTCTCCAACAAGCAGCCCATC 
AAACCAGAGGAGGCTTTGAAGCTGCACAATGGGCCACCTCGCAGTGCCTACATGGAGCAGAGTTT 
CCAGAGCCTAAACCCAGTCCTCAGCCTGCACATCAATCCTGCCCAAGTGGAGCAGGCCCGGAAGA 
ACGCAGCCTTCACTGGGACAGCACTAGAAGCCTGGCTGGTGGGTGGGACTTGGACTGCCATGGAC 
GTCTGTGCCACAGGCCCCACTGCCTGCCCAGTCATGCAAACCTGCAGCCAAACAGCCTGGAGTTC 
CTTCAGCCCTGATCCCAAGTCAGAGCTGGGTGCAGTGAAGCCTGATGGTCGGCTCAGGTAGCACT 
GGCCACATGGGACTGATACAGCCTGAACCCTGGCCTCTGATCTTATATGGGGGAGGGGTTCCTTC 
TGTTCAAGGACCTCTGTTGCCTGTAGGACAGTGGAAAGAGAGGATTTGGAGGCCAGA 
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Fig. 10 

GCGCTCGCGGCCGGGTTGCAGGGCCGGACGCGCGCCCCGCGTCCCGGGCAGGAAGATGGTGGCGA 

GCGCGCGGGTGCAGAAGCTGGTGCGGCGCTACAAGCTGGCGATCGCCACCGCGCTGGCCATCCTG 

CTGCTGCAGGGCCTGGTGGTGTGGAGCTTCAGCGGCCTGGAGGAGGACGAGCCGGGCGAGAAAGG 

AAGGCAGAGAAAGCCACGGCCGCTAGACCCTGGAGAGGGTTCCAAGGACACGGACAGCTCTGCAG 

GGCGTAGGGGCAGTGCTGGGAGAAGGCACGGACGGTGGCGGGGCCGCGCCGAGAGCCCAGGTGTG 

CCTGTGGCCAAGGTGGTGCGGGCAGTAACCAGTCGGCAGAGAGCCAGCCGACGGGTGCCCCCTGC 

CCCACCTCCAGAGGCACCAGGCCGCCAGAACCTGAGTGGAGCAGCAGCCGGGGAAGCACTGATAG 

GTGCCGCCGGCTTCCCACAACATGGAGACACAGGGAGTGTGGAGGGTGCCCCTCAGCCCACGGAT 

AACAGCTTCACTCCGAAGTGTGAGATTGTGGGCAAGGATGCACTGTCAGCACTGGCCCGGGCCAG 

CACCAAGCATTGTCAACAAGAGATTGCTAATGTAGTGTGCCTGCACCAAGCTGGGAACCTAATGC 

CCAAGTCTGTGCCCCGGCACTGCCAGCTGGCTGGCAAGGTGAGCCCTGGCATCCAGTGGGAAGAG 

GTCCGGGCCCAGCAGCCTGTGAGTGGCCCTCTGGTACGCATCGCCTACATGCTGGTGGTTCACGG 

CCGTGCTGTGCGCCAGCTGAAGCGTCTTCTGAAGGCCGTCTACCACGAGGAGCACTTCTTTTATA 

TTCATGTGGACAAGCGTTCCAACTACCTGTACCGGGAGGTGGTAGAGCTGGCCCAGCACTACGAC 

AATGTACGGGTAACACCTTGGCGCATGGTCACCATCTGGGGTGGGGCTAGCCTTCTGAGGATGTA 

CCTGCGGAGCATGAAGGACCTTCTGGAGACTCCTGGCTGGACCTGGGACTTCTTCATCAACCTGA 

GCGCTACTGACTATCCAACCAGGACGAATGAGGAGCTGGTAGCGTTCTTATCCAAGAACCGGGAC 

AAGAATTTCCTCAAGTCACACGGGCGAGACAATTCCAGGTTCATCAAGAAACAAGGCCTGGACCG 

GCTTTTCCATGAGTGTGATTCCCACATGTGGCGCCTGGGTGAACGGCAGATCCCGGCAGGCATCG 

TGGTGGATGGTGGCTCTGACTGGTTCGTGCTGACACGCAGCTTTGTGGAATATGTGGTGTATACA 

GAGGATCCTCTGGTGGCCCAGCTTCGCCAGTTCTATACATACACATTGCTTCCAGCCGAGTCCTT 

CTTCCACACAGTGCTGGAGAACAGCCCAGCCTGTGAGAGCCTAGTGGACAACAACCTGCGGGTTA 

CCAACTGGAACCGGAAGCTGGGCTGCAAGTGCCAGTACAAGCACATCGTGGACTGGTGTGGCTGC 

TCCCCCAACGACTTCAAGCCACAGGACTTCCTGCGGCTTCAGCAAGTCTCCAGACCCACCTTCTT 

TGCCCGGAAGTTTGAGTCGACTGTGAACCAGGAAGTCCTGGAAATTTTGGACTTCCACCTGTATG 

GCAGCTACCCACCCGGCACCCCAGCCCTCAAGGCCTACTGGGAGAACATCTACGACATGGCCGAT 

GGCCCTAGTGGACTCAGCGATGTCCTACTCACTGCTTACACAGCCTTTGCCCGTATCAGTCTGCG 

TCATGCTGCCACTGTTTCCCCACTGGCCACTGCAGTCTGCAGGTTTGAGCCCAGGGGGTTGCCGT 

CCAGCGTGCACCTGTATTTCTATGACGACCATTTCCAGGGCTACCTGGTGACGCAGGCAGTGCAG 

CCCTCAGCCCAGGGGCCAGCAGAGACACTTGAGATGTGGCTGATGCCCCAGAGGTTGCTGAAGCC 

GTTGGGGCACAGTGACCAGGCCAGCCGGCTCCAGAGTCTGGAGGTTGGCACTGAGTGGGACCCCA 

AAGAACGTCTCTTCCGGAACTTTGGGGGCCTGTTGGGACCACTGGATGAACCTGTGGCCATGCAG 

CGCTGGGCCCGGGGCCCCAACCTCACAGCCACTGTGGTCTGGATTGACCCCACCTATGTTGTGGC 

CACATCCTATGACATCACGGTAGATGCGGACACTGAAGTCACGCAGTACAAGCCCCCACTGAGCC 

TGCCACTGCGGCCAGGAGCCTGGACTGTTCGATTGCTTCAGTTCTGGGAGCCCCTGGGTGAGACC 

CGCTTCCTCGTGCTGCCATTGACCTTCAACCACAAACTACCTCTCAGGAAAGATGATGCCAGCTG 

GCTGCATGCGGGACCACCCCACAACGAATACATGGAACAGAGTTTCCAGGGACTAAGTGGCATCC 

TGAATCTGCCTCAGCCAGAGGCCGTGGAGGAGGCTGCCCGGCGGCACACAGAGCTCACAGGTCCT 

GCACTTGAGGCCTGGACAGATGGGGAACTGAGCAGTTTCTGGTCTGTTGCAGGATTGTGTGCCAT 

AGGGCCTTCTTCTTGTCCCTCCCTGGAGCTCTGCAGACTGACCAGCTGGAGCTCTCTGTCTCCTG 

ACCCCAAGTCAGAGCTGGGGCCTGTCAAAGCTGACGGACGACTCAGGTAGCAGGACCCCAGCCAG 

CACAACCCGGAGGAGCCGGGGAATTGCACCTTACAGACAATGGAGGGACGTCTCTCCTCTGGTAG 

CCAGGGAGCCCTAGAACAGTGGTGGGGCTGTCTGTCTGTCAGGCTTTCCTAGGAAATGGACGACA 

TCAGGCCCTGCTGAGATCCATGAAGTTTCCCCACTGGAGGAGGGAGCCCCACTGGCAGCACAG 
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Fig. 11 
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SEQUENCE LISTING 

<110> Kleesiek, Knut 

< 1 2 0 > Xylosyltransferase and isoforms thereof 

<130> Xylosyltransferase 

<140> 
<141> 

<160> 36 

<170> Patentln Ver. 2.1 

<210> 1 
<211> 3726 
<212> DNA 

<213> Homo sapiens - 

<220> ' 

<221> CDS 

<222> (1) . . (2484) 

<223> human XT-I (not completely) 

<220> 

<221> misc__binding 
<222> (544) . . (552) 

<223> binding motif for nucleotide-sugars 
<220> 

<221> misc_feature 
<222> (280) . . (282) 
<223> N-glycosylation site 

<220> 

<221> misc_feature 
<222> (865) . . (867) 
<223> N-glycosylation site 

<220> 

<221> misc_feature 
<222> (1933) . . (1935) 
<223> N-glycosylation site 

<400> 1 

act cag gat ggc tac ttt tct cat egg ccg aaa gag aaa gtg cga aca 48 
Thr Gin Asp Gly Tyr Phe Ser His Arg Pro Lys Glu Lys Val Arg Thr 
15 10 15 

gac age aac aac gag aac tct gtc ccc aaa gac ttt gag aat gtg gac 96 
Asp Ser Asn Asn Glu Asn Ser Val Pro Lys Asp- Phe Glu Asn Val Asp 
20 25 30 

aac age aac ttc gca ccc agg act caa aag cag aag cac cag cct gag 14 4 
Asn Ser Asn Phe Ala Pro Arg Thr Gin Lys Gin Lys His Gin Pro Glu 
35 40 45 

ttg gcg aag aag cca ccg agt aga cag aag gag ctt ttg aaa agg aag 192 
Leu Ala Lys Lys Pro Pro Ser Arg Gin Lys Glu Leu Leu Lys Arg Lys 
50 55 60 
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ctg gaa cag cag gag aaa gga aaa gga cat aca ttc cct ggg aaa ggc 240 
Leu Glu Gin Gin Glu Lys Gly Lys Gly His Thr Phe Pro Gly Lys Gly 
65 70 75 80 

ccc ggt gag gtg ctg cct ccc ggg gac aga gcc gca gcc aac age age 288 
Pro Gly Glu Val Leu Pro Pro Gly Asp Arg Ala Ala Ala Asn Ser Ser 
85 90 95 

cac ggg aag gat gtg tec aga ccg cct cat gcc agg aaa act ggg ggc 336 
His Gly Lys Asp Val Ser Arg Pro Pro His Ala Arg Lys Thr Gly Gly 
100 105 110 

age tec ccc gag ace aag tat gac cag ccc cct aag tgt gac ate tea 384 
Ser Ser Pro Glu Thr Lys Tyr Asp Gin Pro Pro Lys Cys Asp lie Ser 
115 120 125 

ggc aag gag gcc ate tct gcc ctg tec cgt get aag tec aag cac tgc 432 
Gly Lys Glu Ala He Ser Ala Leu Ser Arg Ala .Lys Ser Lys His Cys 
130 135 140 

cgc cag gag att ggg gag act tac tgc cgc cac aag tta ggg ctg ctg 480 
Arg Gin Glu He Gly Glu Thr Tyr Cys Arg His Lys Leu Gly Leu Leu 
145 150 155 160 

atg cct gag aag gtg act egg ttc tgc ccc etc gag ggt aaa gcc aac 528 
Met Pro Glu Lys Val Thr Arg Phe Cys Pro Leu Glu Gly Lys Ala Asn 
165 170 175 

aag aac gtg cag tgg gac gag gac tec gtg gag tac atg cca gcc aac 576 
Lys Asn Val Gin Trp Asp Glu Asp Ser Val Glu Tyr Met Pro Ala Asn 
180 ^ 185 190 

ccg gtc aga ate gcc ttt gtc ctg gtg gtc cac ggc cgt gcc tct egg 624 
Pro Val Arg He Ala Phe Val Leu Val Val His Gly Arg Ala Ser Arg 
195 200 205 

cag ttg cag cgc atg ttc aag gcc ate tac cac aaa gac cac ttc tac 672 
Gin Leu Gin Arg Met Phe Lys Ala He Tyr His Lys Asp His Phe Tyr 
210 215 220 

tac ate cac gtg gac aag cgc tct aat tac ctg cat egg caa gtg etc 720 
Tyr lie His Val Asp Lys Arg Ser Asn Tyr Leu His Arg Gin Val Leu 
225 230 235 240 

cag gtc tec agg cag tac age aat gtc cgc gtc ace ccc tgg aga atg 7 68 
Gin Val Ser Arg Gin Tyr Ser Asn Val Arg Val Thr Pro Trp Arg Met 
245 250 255 



gcc ace ate tgg gga gga gcc age etc ctg tec ace tac ctg cag age 
Ala Thr He Trp Gly Gly Ala Ser Leu Leu Ser Thr Tyr Leu Gin Ser 
260 * ~ 265 270 



816 



atg egg gac etc ctg gag atg acc gac tgg ccc tgg gac ttc ttc ate 864 
Met Arg Asp Leu Leu Glu Met Thr Asp Trp Pro Trp Asp Phe Phe He 
275 280 285 

aac ctg agt gcg gcc gac tac ccc ate agg aca aat gac cag ttg gtg 912 
Asn Leu Ser Ala Ala Asp Tyr Pro He Arg Thr Asn Asp Gin Leu Val 
290 295 300 

gcg ttt etc tec cga tac cga gat atg aat ttc ttg aag tea cac ggc 960 
Ala Phe Leu Ser Arg Tyr Arg Asp Met Asn Phe Leu Lys Ser His Gly 
305 310 315 320 
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egg gac aat gca agg ttc att egg aag cag ggc ctg gat egg etc ttc 1008 
Arg Asp Asn Ala Arg Phe lie Arg Lys Gin Gly Leu Asp Arg Leu Phe 
325 330 335 

ctg gag tgc gac get cac atg tgg cgc ctg gga gat egg egg ate cca 1056 
Leu Glu Cys Asp Ala His Met Trp Arg Leu Gly Asp Arg Arg He Pro 
340 345 350 

gag ggc att gee gtg gat ggc ggt teg gac tgg ttc ctg ctg aac egg 1104 
Glu Gly lie Ala Val Asp Gly Gly Ser Asp Trp Phe Leu Leu Asn Arg 
355 360 365 

agg ttt gta gaa tat gtg acc ttc tec aca gac gat ctg gtg ace aag 1152 
Arg, Phe Val Glu Tyr Val Thr Phe Ser Thr Asp Asp Leu Val Thr Lys 
370 375 380 

atg aaa cag ttc tac tec tac acc ctg ctt cct get gag tec ttc ttc 1200 
Met Lys Gin Phe Tyr Ser Tyr Thr Lteu Leu Pro Ala Glu Ser Phe Phe 
385 * 390 395 400 

cat acg gtc ctg gag aac age ccc cac tgc gac acc atg gtg gac aac 1248 
His Thr Val Leu Glu Asn Ser Pro His Cys Asp Thr Met Val Asp Asn 
405 410 .415 

aac ctg cgc ate acc aac tgg aat cgc aag ctg ggc tgc aag tgc cag 1296 
Asn Leu Arg He Thr Asn Trp Asn Arg Lys Leu Gly Cys Lys Cys Gin 
420 425 430 

tac aag cac ate gtg gac tgg tgc ggc tgc tec ccc aat gac ttc aag 1344 
Tyr Lys His He Val Asp Trp Cys Gly Cys Ser Pro Asn Asp Phe Lys 
435 440 445 

ccg cag gac ttc cac cgc ttc cag cag aca gee egg cct acc ttc ttt 1392 
Pro Gin Asp Phe His Arg Phe Gin Gin Thr Ala Arg Pro Thr Phe Phe 
450 455 460 

gec cgc aag ttt gaa gee gtg gtg aat cag gaa ate att ggg cag ctg 14 40 
Ala Arg Lys Phe Glu Ala Val Val Asn Gin Glu He He Gly Gin Leu 
465 ~ 470 475 480 

gac tat tac ctg tac ggg aac tac cct gca ggt acc ccg ggc ctg cgc 1488 
Asp Tyr Tyr Leu Tyr Gly Asn Tyr Pro Ala Gly Thr Pro Gly Leu Arg 
485 490 495 

tec tac tgg gag aat gtc tac gat gag cct gac ggc ate cac age ctg 1536 
Ser Tyr Trp Glu Asn Val Tyr Asp Glu Pro Asp Gly lie His Ser Leu 
500 505 510 

age gac gtg aca etc acc ttg tac cac tec ttt gee cgc ctg ggt ctt 1584 
Ser Asp Val Thr Leu Thr Leu Tyr His Ser Phe Ala Arg Leu Gly Leu 
515 520 525 

cga egg get gag acg tec ctg cac acg gat ggg gag aac age tgc cga 1632 
Arg Arg Ala Glu Thr Ser Leu His Thr Asp Gly Glu Asn Ser Cys Arg 
530 535 540 

tac tac cca atg ggc cac cca gca tct gtg cac etc tac ttc ctt get 1680 
Tyr Tyr Pro Met Gly His Pro Ala Ser Val His Leu Tyr Phe Leu Ala 
545 550 555 560 

gac cgc ttc cag ggc ttt ctg ate aag cat cat get acc aat ctg get 1728 
Asp Arg Phe Gin Gly Phe Leu He Lys His His Ala Thr Asn Leu Ala 
565 570 575 
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gtg age aaa eta gag act ctg gag ace tgg gtg atg ccg aaa aaa gtc 177 6 

Val Ser Lys Leu Glu Thr Leu Glu Thr Trp Val Met Pro Lys Lys Val 
580 585 590 

ttc aag ate gca age cca ccc agt gac ttt ggg agg ctt cag ttt tec 1824 
Phe Lys lie Ala Ser Pro Pro Ser Asp Phe Gly Arg Leu Gin Phe Ser 
595 600 605 

gag gtc ggc act gac tgg gat gec aag gag agg eta ttc cgc aac ttt 1872 
Glu Val Gly Thr Asp Trp Asp Ala Lys Glu Arg Leu Phe Arg Asn Phe 
610 615 620 

999 99^ ctt ctg ggg ccc atg gat gag ccg gtg ggt atg cag aag tgg 1920 
Gly Gly Leu Leu Gly Pro Met Asp Glu Pro Val Gly Met Gin Lys Trp 
625 630 635 640 

ggg aag gga cct aat gtg acc gtg acc gtc att tgg gtg gat ccc gtc 1968 
Gly Lys Gly Pro Asn Val Thr Val Thr Val He .Trp Val Asp Pro Val 
645 650 655 

aat gtc ate gca gee acc tac gac ate etc att gag tec act gee gaa 2016 
Asn Val He Ala Ala Thr Tyr Asp He Leu He Glu Ser Thr Ala Glu 
660 665 670 

ttc aca cac tac aag ccc cct ttg aac ttg ccc ctg agg cct ggg gtc 2064 
Phe Thr His Tyr Lys Pro Pro Leu Asn Leu Pro Leu Arg Pro Gly Val 
675 " 680 685 

tgg aca gtg aaa att etc cac cac tgg gtg cca gtt gca gag acc aaa 2112 
Trp Thr Val Lys He Leu His His Trp Val Pro Val Ala Glu Thr Lys 
690 695 700 

ttc etc gtt gcg cct ctg acc ttc teg aac agg cag ccc ate aaa cct 2160 
Phe Leu Val Ala Pro Leu Thr Phe Ser Asn Arg Gin Pro He Lys Pro 
705 710 715 720 

gag gag gca ctg aag ctg cac aat ggg ccc etc cgc aat gee tac atg 2208 
Glu Glu Ala Leu Lys Leu His Asn Gly Pro Leu Arg Asn Ala Tyr Met 
725 730 735 

gag cag age ttc cag age eta aac ccc gtc etc age ctg ccc ate aac 2256 
Glu Gin Ser Phe Gin Ser Leu Asn Pro Val Leu Ser Leu Pro He Asn 
740 745 750 

ccc gee cag gtg gaa cag gca egg agg aac gca gee tec acg ggc aca 2304 
Pro Ala Gin Val Glu Gin Ala Arg Arg Asn Ala Ala Ser Thr Gly Thr 
755 760 765 

gcg ctg gag gga tgg ctg gac teg ttg gtg ggc ggg atg tgg act gee 2352 
Ala Leu Glu Gly Trp Leu Asp Ser Leu Val Gly Gly Met Trp Thr Ala 
770 775 780 

atg gac ate tgt gee acg ggc ccc aca gee tgc ccg gtc atg cag acc 2400 
Met Asp He Cys Ala Thr Gly Pro Thr Ala Cys Pro Val Met Gin Thr 
785 * 790 795 800 

tgc age cag acg gee tgg age tec ttc age cct gac ccc aag teg gag 24 48 
Cys Ser Gin Thr Ala Trp Ser Ser Phe Ser Pro Asp Pro Lys Ser Glu 
805 810 815 

ctg ggt gec gtc aaa cct gat ggc egg etc agg tag cactgggcac 24 94 

Leu Gly Ala Val Lys Pro Asp Gly Arg Leu Arg 
820 825 
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gaggagtggg ccacagcagg atctcaacgg 
accccggcct ccaccctggg ggaggccctc 
atggaaaggg aaggtcagca ggtcaaagca 
tgcgggtggg atggctcagt ccctgcactg 
aagtcctaca ggttccttgt cttccccttc 
attttcagac ttttctttcg agcagcagaa 
ggttcagggt gctgacgaaa tccaagctgc 
ctcccaaagc tcagccctct gggcggtccc 
tcagaaccca agggctctgc aaatgccagt 
ttcaaactcc agccctgccc tttggttcat 
gtgccagcct tgaaagtcat ggtcgtgggc 
agctctgagc ctctcagcag ccctccttgg 
gtgggaaagc cttttgatgt ggcatggcca 
tccaggggtc tgggcctcct tccctcaagg 
tgtttgtgag caccagactc aagaccatga 
gacagccatg accatgaaac tcacaaggca 
gagcaaaact cctgagctag acagcacagc 
ttgaggtcag acacacaaaa ccttcgtcaa 
accacatgaa ccagactttt cccgtccagg 
catgcccttc ggccacaggc tcaacgggac 
cccgcgggaa aaaaaaaaaa aaaaaaaaaa 



5/32 

gaaagcagcc agaggggttg tggggcctga 2554 

tgtgaatggg tctctcctgg ccatagaatg 2614 

ggatcagcca acaacctgcc tttggcaagc 2674 

tgactgtctc acctcttctg sttgatcctc 2734 

cagtgaccca cccctgaccc cagacgtgtg 2794 

cttcgtttac ggagcacagt cataagtgra 2854 

tctggttgaa gctgacaagt gcgaggttcc 2914 

cttgcccagg gtatctccta cggtacctct 2974 

ttgacaagca ctgcccagac caaccatggg 3034 

ttttctgctt ctcttggctg ggggactctg 3094 

cctttcccat ggaggctgca gccttaggag 3154 

gttgaactat tctccttagt aactaggtaa 3214 

aggtccagcc acaagtgcaa ctgccacctg 3274 

ctgccacaca aagtagcaga aataggatga 3334 

ccttctttga tccttgaaaa tgggaacttt 3394 

acgcggatga aactcacaaa gcaatgcttg 3454 

agcacccatc ccctgccaga gcccttccgt 3514 

^ttgcacaccg gtgctgttgg gagtgaccaa 3574 

aaatagcatt tcagatttgg tttttaattt 3634 

atgcaacata aaaatgggaa ggttattcaa 3694 
aa ~ 3726 



<210> 2 

<211> 827 

<212> PRT 

<213> Homo sapiens 

<400> 2 

Thr Gin Asp Gly Tyr Phe Ser His Arg Pro Lys Glu Lys Val Arg Thr 
1 5 10 15 

Asp Ser Asn Asn Glu Asn Ser Val Pro Lys Asp Phe Glu Asn Val Asp 
20 25 30 

Asn Ser Asn Phe Ala Pro Arg Thr Gin Lys Gin Lys His Gin Pro Glu 
35 40 45 

Leu Ala Lys Lys Pro Pro Ser Arg Gin Lys Glu Leu Leu Lys Arg Lys 
50 55 60 

Leu Glu Gin Gin Glu Lys Gly Lys Gly His Thr Phe Pro Gly Lys Gly 
65 70 75 80 

Pro Gly Glu Val Leu Pro Pro Gly Asp Arg Ala Ala Ala Asn Ser Ser 
85 90 95 

His Gly Lys Asp Val Ser Arg Pro Pro His Ala Arg Lys Thr Gly Gly 
100 105 110 

Ser Ser Pro Glu Thr Lys Tyr Asp Gin Pro Pro Lys Cys Asp lie Ser 
115 " 120 ~ 125 

Gly Lys Glu Ala lie Ser Ala Leu Ser Arg Ala Lys Ser Lys His Cys 
130 135 140 

Arg Gin Glu lie Gly Glu Thr Tyr Cys Arg His Lys Leu Gly Leu Leu 
145 150 155 160 

Met Pro Glu Lys Val Thr Arg Phe Cys Pro Leu Glu Gly Lys Ala Asn 
165 170 175 
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Lys Asn Val Gin Trp Asp Glu Asp Ser Val Glu Tyr Met Pro Ala Asn 
180 185 190 

Pro Val Arg lie Ala Phe Val Leu Val Val His Gly Arg Ala Ser Arg 
195 200 205 

Gin Leu Gin Arg Met Phe Lys Ala lie Tyr His Lys Asp His Phe Tyr 
210 215 220 

Tyr lie His Val Asp Lys Arg Ser Asn Tyr Leu His Arg Gin Val Leu 
225 230 235 240 

Gin Val Ser Arg Gin Tyr Ser Asn Val Arg Val Thr Pro Trp Arg Met 
245 250 255 

Ala Thr lie Trp Gly Gly Ala Ser Leu Leu Ser Thr Tyr Leu Gin Ser 
260 " 265 270 

Met Axg Asp Leu Leu Glu Met Thr Asp Trp Pro Trp Asp Phe Phe lie 
275 280 285 

Asn Leu Ser Ala Ala Asp Tyr Pro lie Arg Thr Asn Asp Gin Leu Val 
290 295 300 

Ala Phe Leu Ser Arg Tyr Arg Asp Met Asn Phe Leu Lys Ser His Gly 
305 310 315 320 

Arg Asp Asn Ala Arg Phe He Arg Lys Gin Gly Leu Asp Arg Leu Phe 
325 330 335 

Leu Glu Cys Asp Ala His Met Trp Arg Leu Gly Asp Arg Arg He Pro 
340 345 350 

Glu Gly He Ala Val Asp Gly Gly Ser Asp Trp Phe Leu Leu Asn Arg 
355 360 365 

Arg Phe Val Glu Tyr Val Thr Phe Ser Thr Asp Asp Leu Val Thr Lys 
370 375 380 

Met Lys Gin Phe Tyr Ser Tyr Thr Leu Leu Pro Ala Glu Ser Phe Phe 
385 390 395 400 

His Thr Val Leu Glu Asn Ser Pro His Cys Asp Thr Met Val Asp Asn 
405 410 415 

Asn Leu Arg He Thr Asn Trp Asn Arg Lys Leu Gly Cys Lys Cys Gin 
420 425 430 

Tyr Lys His He Val Asp Trp Cys Gly Cys Ser Pro Asn Asp Phe Lys 
435 440 445 

Pro Gin Asp Phe His Arg Phe Gin Gin Thr Ala Arg Pro Thr Phe Phe 
450 455 460 

Ala Arg Lys Phe Glu Ala Val Val Asn Gin Glu He He Gly Gin Leu 
465 " 470 475 480 

Asp Tyr Tyr Leu Tyr Gly Asn Tyr Pro Ala Gly Thr Pro Gly Leu Arg 
485 " 490 495 



Ser Tyr Trp Glu Asn Val Tyr Asp Glu Pro Asp Gly He His Ser Leu 
500 505 510 
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Ser Asp Val Thr Leu Thr Leu Tyr His Ser Phe Ala Arg Leu Gly Leu 
515 520 525 

Arg Arg Ala Glu Thr Ser Leu His Thr Asp Gly Glu Asn Ser Cys Arg 
530 535 540 

Tyr Tyr Pro Met Gly His Pro Ala Ser Val His Leu Tyr Phe Leu Ala 
545 550 555 560 

Asp Arg Phe Gin Gly Phe Leu lie Lys His His Ala Thr Asn Leu Ala 
565 570 575 

Val Ser Lys Leu Glu Thr Leu Glu Thr Trp Val Met Pro Lys Lys Val 
580 585 590 

Phe Lys lie Ala Ser Pro Pro Ser Asp Phe Gly Arg Leu Gin Phe Ser 
595 600 605 

4 

Glu Val Gly Thr Asp Trp Asp Ala Lys Glu Arg Leu Phe Arg Asn Phe 
610 615 620 

Gly Gly Leu Leu Gly Pro Met Asp Glu Pro Val Gly Met Gin Lys Trp 
625 630 635 640 

Gly Lys Gly Pro Asn Val Thr Val Thr Val He Trp Val Asp Pro Val 
645 650 655 

Asn Val He Ala Ala Thr Tyr Asp He Leu He Glu Ser Thr Ala Glu 
660 665 670 

Phe Thr His Tyr Lys Pro Pro Leu Asn Leu Pro Leu Arg Pro Gly Val 
675 680 685 

Trp Thr Val Lys lie Leu His His Trp Val Pro Val Ala Glu Thr Lys 
690 695 700 

Phe Leu Val Ala Pro Leu Thr Phe Ser Asn Arg Gin Pro He Lys Pro 
705 710 715 720 

Glu Glu Ala Leu Lys Leu His Asn Gly Pro Leu Arg Asn Ala Tyr Met 
725 730 735 

Glu Gin Ser Phe Gin Ser Leu Asn Pro Val Leu Ser Leu Pro He Asn 
740 745 750 

Pro Ala Gin Val Glu Gin Ala Arg Arg Asn Ala Ala Ser Thr Gly Thr 
755 760 765 

Ala Leu Glu Gly Trp Leu Asp Ser Leu Val Gly Gly Met Trp Thr Ala 
770 775 780 

Met Asp He Cys Ala Thr Gly Pro Thr Ala Cys Pro Val Met Gin Thr 
785 790 795 800 

Cys Ser Gin Thr Ala Trp Ser Ser Phe Ser Pro Asp Pro Lys Ser Glu 
805 810 * 815 



Leu Gly Ala Val Lys Pro Asp Gly Arg Leu Arg 
820 825 



WO 01/49831 



PCT/EP00/13311 



8/32 

<210> 3 

<211> 3608 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 

<222> (150) . . (2744) 
<223> human XT-II 

<220> 

<221> misc_£eature 
<222> (513) . . (515) 
<223> N-glycosylation site 

<220> 

<221> misc_feature 
<222> (1128) . . (1130) 
<223> N-glcosylation site 

<220> 

<221> misc_feature 
<222> (2196) . . (2198) 
<223> N-glycosylation site 

<400> 3 

cgcaggcccc gcccccggcc ccgccccggc gcgcgcccgg cccgcctccc taggcgtgga 60 

ggaggggggg cggctcagcc ccgcgcccgt gcgcgcgctc gcggccgggt tgcagggctg 120 

ggcgcgcgcc ccgcgtcccg ggcaggaag atg gtg gcg age gcg cga gtg cag 173 

Met Val Ala Ser Ala Arg Val Gin 
1 5 



aag ctg gtg egg cgc tac aag ctg gcg att gec acg gcg ctg gec ate 
Lys Leu Val Arg Arg Tyr Lys Leu Ala lie Ala Thr Ala Leu Ala lie 
10 15 20 



221 



ctg ctg ctg cag ggc ctg gta gtg tgg age ttc age ggc ctg gag gag 
Leu Leu Leu Gin Gly Leu Val Val Trp Ser Phe Ser Gly Leu Glu Glu 
25 30 35 40 



269 



gac gag gcg ggc gag aaa gga agg cag agg aag cca egg cca ctg gac 
Asp Glu Ala Gly Glu Lys Gly Arg Gin Arg Lys Pro Arg Pro Leu Asp 
45 50 55 



317 



cct ggc gaa ggt tec aag gac aca gac agt tea gca ggg cga egg ggc 
Pro Gly Glu Gly Ser Lys Asp Thr Asp Ser Ser Ala Gly Arg Arg Gly 
60 65 70 



365 



age aca ggc aga agg cat ggg cgc tgg egg ggc cgt get gag age cca 
Ser Thr Gly Arg Arg His Gly Arg Trp Arg Gly Arg Ala Glu Ser Pro 
75 80 ~ 85 



413 



gga gtg ccc gtg gee aag gtg gta egg gca gta ace age egg cag aga 
Gly Val Pro Val Ala Lys Val Val Arg Ala Val Thr Ser Arg Gin Arg 
90 95 100 



4 61 



gec age egg egg gtc cca cct gee cca cct ccg gaa gee cca ggc cgc 
Ala Ser Arg Arg Val Pro Pro Ala Pro Pro Pro Glu Ala Pro Gly Arg 
105 HO 115 120 



509 



cag aac ctg agt ggg gca gca get ggg gag gcg ctg gta ggg gca get 
Gin Asn Leu Ser Gly Ala Ala Ala Gly Glu Ala Leu Val Gly Ala Ala 
125 130 135 



557 



r 
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ggc ttc cca cca cac gga gat aca ggg age gtg gag ggc gec ccc cag 605 
Gly Phe Pro Pro His Gly Asp Thr Gly Ser Val Glu Gly Ala Pro Gin 
140 145 150 

ccc acg gac aat ggc ttc acc ccc aag tgc gag ate gtg ggc aag gac 653 
Pro Thr Asp Asn Gly Phe Thr Pro Lys Cys Glu lie Val Gly Lys Asp 
155 160 165 

gca ctg tct gca ctg gec egg gec age acc aag cag tgc cag cag gag 701 
Ala Leu Ser Ala Leu Ala Arg Ala Ser Thr Lys Gin Cys Gin Gin Glu 
170 175 180 

ate gec aat gtg gtg tgc ctg cac cag get ggg age etc atg ccc aag 74 9 
He Ala Asn Val Val Cys Leu His Gin Ala Gly Ser Leu Met Pro Lys 
185 190 195 200 

get gtg ccc egg cac tgt cag ctg act ggg aag atg age ccc ggc ate 797 
Ala Val Pro Arg His Cys Gin Leu Thr Gly Lys -Met Ser Pro Gly He 
205 210 215 

cag tgg gat gag age caa gec cag cag ccc atg gat ggc ccc ccg gtg 845 
Gin Trp Asp Glu Ser Gin Ala Gin Gin Pro Met Asp Gly Pro Pro Val 
220 225 230 

cga ate gee tac atg ctg gtg gtt cac ggc cgc gee ate cgc cag ctg 893 
Arg He Ala Tyr Met Leu Val Val His, Gly Arg Ala He Arg Gin Leu 
235 240 245 

aag cgt etc etc aag gec gtt tat cac gag cag cac ttc ttt tac ate 941 
Lys Arg Leu Leu Lys Ala Val Tyr His Glu Gin His Phe Phe Tyr He 
250 255 - 260 

cat gtg gac aag cgt tec gac tac ctg cac egg gag gtg gtg gag ctg 989 
His Val Asp Lys Arg Ser Asp Tyr Leu His Arg Glu Val Val Glu Leu 
265 270 275 280 

gec cag ggc tat gat aac gtg egg gtg acg ccc tgg cgc atg gtt acc 1037 
Ala Gin Gly Tyr Asp Asn Val Arg Val Thr Pro Trp Arg Met Val Thr 
285 290 295 

ate tgg ggc ggg gee age etc ctg acg atg tac ctg egg age atg egg 1085 
He Trp Gly Gly Ala Ser Leu Leu Thr Met Tyr Leu Arg Ser Met Arg 
300 305 310 

gac ctg eta gag gtg cct ggc tgg gee tgg gac ttc ttc ate aac etc 1133 
Asp Leu Leu Glu Val Pro Gly Trp Ala Trp Asp Phe Phe He Asn Leu 
315 320 325 

agt gee act gac tat cca acc agg acc aat gag gag ctg gtg gca ttc 1181 
Ser Ala Thr Asp Tyr Pro Thr Arg Thr Asn Glu Glu Leu Val Ala Phe 
330 335 340 

eta tec aag aac egg gac aag aat ttc etc aag tea cat ggc egg gac 1229 
Leu Ser Lys Asn Arg Asp Lys Asn Phe Leu Lys Ser His Gly. Arg Asp 
345 350 355 360 

aac tec agg ttc ate aag aaa cag ggc ctg gac egg etc ttc cat gag 1277 
Asn Ser Arg Phe He Lys Lys Gin Gly Leu Asp Arg Leu Phe His Glu 
365 370 375 

tgc gac tea cac atg tgg cgc ctg ggc gag egg cag ate cca gca ggc 1325 
Cys Asp Ser His Met Trp Arg Leu Gly Glu Arg Gin He Pro Ala Gly 
380 385 390 
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att gtg gtg gat ggc ggt tct gac tgg ttc gtg ctg aca cgc age ttt 1373 

lie Val Val Asp Gly Gly Ser Asp Trp Phe Val Leu Thr Arg Ser Phe 
395 400 405 

gtg gag tat gtg gtg tac aca gat gac ccg ctt gtg gec cag ctg cgc 1421 
Val Glu Tyr Val Val Tyr Thr Asp Asp Pro Leu Val Ala Gin Leu Arg 
410 415 420 

cag ttc tac aca tac aca ctg etc cca gec gag tec ttc ttc cac acg 14 69 
Gin Phe Tyr Thr Tyr Thr Leu Leu Pro Ala Glu Ser Phe Phe His Thr 
425 430 435 440 

gtg ctg gag aac age ctg gee tgt gag ace etc gtg gac aac aac ctg 1517 
Val Leu Glu Asn Ser Leu Ala Cys Glu Thr Leu Val Asp Asn Asn Leu 
445 450 455 

egg gtc acc aac tgg aac cgc aag ctg ggc tgc aag tgc cag tac aag 1565 
Arg Val Thr Asn Trp Asn Arg Lys L ! eu Gly Cys -Lys Cys Gin Tyr Lys 
460 465 470 

cac att gtg gac tgg tgt ggc tgc tec ccc aac gac ttc aag cca cag 1613 
His lie Val Asp Trp Cys Gly Cys Ser Pro Asn Asp Phe Lys Pro Gin 
475 480 485 



gac ttc etc egg ctg cag caa gtc tec aga ccc acc ttc ttc gec egg 
Asp Phe Leu Arg Leu Gin Gin Val Ser Arg Pro Thr Phe Phe Ala Arg 
490 495 500 



tgg gag aac acc tac gac gcg get gat ggc ccc agt ggg etc agt gat 
Trp Glu Asn Thr Tyr Asp Ala Ala Asp Gly Pro Ser Gly Leu Ser Asp 
540 545 550 

gtc atg etc act get tac aca gee ttc gee cgc etc age ctg cac cat 
Val Met Leu Thr Ala Tyr Thr Ala Phe Ala Arg Leu Ser Leu His His 
555 560 565 

gee gec act get gca ccc cca atg ggc acc cca etc tgc agg ttt gag 
Ala Ala Thr Ala Ala Pro Pro Met Gly Thr Pro Leu Cys Arg Phe Glu 
570 575 580 



ccg gca gag acg ctt gag atg tgg ctg atg ccc caa ggg teg ctg aag 
Pro Ala Glu Thr Leu Glu Met Trp Leu Met Pro Gin Gly Ser Leu Lys 
620 625 630 



1661 



aag ttc gag. teg act gtg aac cag gag gtg ctg gaa ate ctg gac ttc 1709 
Lys Phe Glu Ser Thr Val Asn Gin Glu Val Leu Glu lie Leu Asp Phe 
505 510 515 520 

cac ctg tac ggc age tac ccc ccc ggc acg cca gee etc aag gec tac 1757 
His Leu Tyr Gly Ser Tyr Pro Pro Gly Thr Pro Ala Leu Lys Ala Tyr 
525 530 535 



1805 



1853 



1901 



ccc agg ggc ttg ccg tec age gtg cac ctg tat ttc tat gac gac cat 194 9 
Pro Arg Gly Leu Pro Ser Ser Val His Leu Tyr Phe Tyr Asp Asp His 
585 ~ " 590 595 600 

ttc cag ggc tac ctg gtg acg cag gcg gtg cag ccc tea gee cag ggg 1997 
Phe Gin Gly Tyr Leu Val Thr Gin Ala Val Gin Pro Ser Ala Gin Gly 
605 610 615 



2045 



ctg ttg ggg cgc agt gac cag gee age egg etc cag agt ctg gag gtt 2093 
Leu Leu Gly Arg Ser Asp Gin Ala Ser Arg Leu Gin Ser Leu Glu Val 
635 " 640 645 
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ggc act gat tgg gac ccc aaa gag cgt ctt ttc egg aac ttt ggg ggg 
Gly Thr Asp Trp Asp Pro Lys Glu Arg Leu Phe Arg Asn Phe Gly Gly 
650 655 660 



2141 



tta ctg ggg ccg ctg gac gag cct gtg gec gtg cag cgc tgg gec egg 
Leu Leu Gly Pro Leu Asp Glu Pro Val Ala Val Gin Arg Trp Ala Arg 
665 ~* 670 675 680 



2189 



ggc ccc aac etc aca gee aca gtg gtc tgg ate gac cca acc tat gtg 
Gly Pro Asn Leu Thr Ala Thr Val Val Trp He Asp Pro Thr Tyr Val 
685 690 695 



2237 



gtg gec aca tct tat gac ate aca gta gat acg gag act gag gtc acg 
Val Ala Thr Ser Tyr Asp He Thr Val Asp Thr Glu Thr Glu Val Thr 
700 705 710 



2285 



caa tac aag ccc cca ctg age egg ccc ctg egg cca ggg ccc tgg act 
Gin Tyr Lys Pro Pro Leu Ser Arg Pro Leu Arg -Pro Gly Pro Trp Thr 
715 720 725 



2333 



gtt cga etc ctt cag ttc tgg gaa ccg ctg ggt gag acc cgc ttc ctt 
Val Arg Leu Leu Gin Phe Trp Glu Pro Leu Gly Glu Thr Arg Phe Leu 
730 735 740 



2381 



gtg ctg ccc ttg acc ttc aac cgc aaa eta cct etc agg aaa gat gat 
Val Leu Pro Leu Thr Phe Asn Arg Lys Leu Pro Leu Arg Lys Asp Asp 
745 750 ~ 755 760 



2429 



gec age tgg ctg cac gca ggg cca ccc cac aac gag tac atg gag cag 
Ala Ser Trp Leu His Ala Gly Pro Pro His Asn Glu Tyr Met Glu Gin 
765 770 775 



2477 



agt ttc cag ggc ctg agt age ate ctg aac ctg cct cag ccg gag etc 
Ser Phe Gin Gly Leu Ser Ser He Leu Asn Leu Pro Gin Pro Glu Leu 
780 785 790 



2525 



gcg gag gag get gec cag egg cac aca cag etc aca ggc cct gcg etc 2573 

Ala Glu Glu Ala Ala Gin Arg His Thr Gin Leu Thr Gly Pro Ala Leu 
795 800 805 

gag gec tgg aca gac agg gaa ctg age age ttc tgg tec gtg get gga 2621 

Glu Ala Trp Thr Asp Arg Glu Leu Ser Ser Phe Trp Ser Val Ala Gly 
810 815 820 

ctg tgt gec ata ggc ccc tct ccc tgc ccc tec ctg gag ccc tgc aga 2669 

Leu Cys Ala He Gly Pro Ser Pro Cys Pro Ser Leu Glu Pro Cys Arg 
825 830 835 840 

ctg acc age tgg age tct ctg tec ccc gac ccc aaa tea gag ctg ggg 2717 

Leu Thr Ser Trp Ser Ser Leu Ser Pro Asp Pro Lys Ser Glu Leu Gly 
845 850 ' 855 



cct gtc aaa gca gac ggg cga etc agg tagcagggee ccagccagta 
Pro Val Lys Ala Asp Gly Arg Leu Arg 
860 * 865 



2764 



cccgtggagg 
gcaagaacca 
cagggaaggt 
gegggaaege 
ctagtttgaa 
gggtggttgg 
aagtgggcca 



accegggaaa 
gaggeccagg 
ggacacagta 
gagaagggca 
tttctttttt 
gagagaaact 
tgtctgtggg 



ttgeacctta 
ctgcacaccc 
tgaactactg 
ccagcagcat 
ttctttcttt 
agaacagaag 
gagcagtggc 



cagacagtgg 
atttcageca 
ctgatgtctc 
tccacaccca 
tttttttttt 
atgtgcaata 
ttgetgagee 



aggggtgtcc 
tcaagaaccc 
tgttggggat 
gctcttcctc 
tttaatttaa 
gggctcagag 
cacctgctat 



cctcccacag 2824 
acacagaegg 2884 
cagagggctg 2944 
accttcctgt 3004 
aaaggaaaat 3064 
cagccccagg 3124 
tgttctgcac 3184 
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gaggctgggc ctgcctcact ctccagggcc tcatgcccca ttctgggcct gtggtgctcg 324 4 
tggctgaggc tccacagggc tgcaagtgcc ctgccaggct ctaaggcccg aagaacaggt 3304 
gatatcgggg gcgctgcaga gctgggctct agcaggacag ttcttttgta gccaggggac 3364 
cctagaagtg gggtggggcg gctccagggc tttccagcat accctgcccc tggatgggaa 3424 
aggcagggtc agggcccact gagacccatg cagagtttcc aggtgtgcac tggaagtgag 3484 
gtcacatgag cagcgtggga agaagactct gtcaagactc tcagaagaac ctggagtaat 354 4 
tgtgcctgaa gctcagcgtg aagtctgaaa tatgcaacag aagaaatata tctctatctc 3604 
tcta 3608 



<210> 4 
<211> 865 
<212> PRT 

<213> Homo sapiens 
<400> 4 

Met Val Ala Ser Ala Arg Val Gin Lys Leu Val Arg Arg Tyr Lys Leu 
1 5 ' 10 15 

Ala lie Ala Thr Ala Leu Ala lie Leu Leu Leu Gin Gly Leu Val Val 
20 25 30 

Trp Ser Phe Ser Gly Leu Glu Glu Asp Glu Ala Gly Glu Lys Gly Arg 
35 "* 40 45 

Gin Arg Lys Pro Arg Pro Leu Asp Pro Gly Glu Gly Ser Lys Asp Thr 
50 55 60 

Asp Ser Ser Ala Gly Arg Arg Gly Ser Thr Gly Arg Arg His Gly Arg 
65 70 75 80 

Trp Arg Gly Arg Ala Glu Ser Pro Gly Val Pro Val Ala Lys Val Val 
85 90 95 

Arg Ala Val Thr Ser Arg Gin Arg Ala Ser Arg Arg Val Pro Pro Ala 
100 105 110 

Pro Pro Pro Glu Ala Pro Gly Arg Gin Asn Leu Ser Gly Ala Ala Ala 
115 120 125 

Gly Glu Ala Leu Val Gly Ala Ala Gly Phe Pro Pro His Gly Asp Thr 
130 135 140 

Gly Ser Val Glu Gly Ala Pro Gin Pro Thr Asp Asn Gly Phe Thr Pro 
145 150 . 155 160 

Lys Cys Glu lie Val Gly Lys Asp Ala Leu Ser Ala Leu Ala Arg Ala 
165 170 175 

Ser Thr Lys Gin Cys Gin Gin Glu lie Ala Asn Val Val Cys Leu His 
180 185 190 

Gin Ala Gly Ser Leu Met Pro Lys Ala Val Pro Arg His Cys Gin Leu 
195 200 205 

Thr Gly Lys Met Ser Pro Gly He Gin Trp Asp Glu Ser Gin Ala Gin 
210 215 220 

Gin Pro Met Asp Gly Pro Pro Val Arg He Ala Tyr Met Leu Val Val 
225 230 235 240 

His Gly Arg Ala He Arg Gin Leu Lys Arg Leu Leu Lys Ala Val Tyr 
245 250 255 
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His Glu Gin His Phe Phe Tyr lie His Val Asp Lys 
260 265 



Arg Ser Asp Tyr 
270 



Leu His Arg Glu Val Val Glu Leu Ala Gin Gly Tyr 
275 280 



Asp Asn Val Arg 
285 



Val Thr Pro Trp Arg Met Val Thr He Trp Gly Gly Ala Ser Leu Leu 
290 295 ~ " 300 



Thr Met Tyr Leu Arg Ser Met Arg Asp Leu Leu Glu 
305 ~ 310 315 



Val Pro Gly Trp 
320 



Ala Trp Asp Phe Phe He Asn Leu Ser Ala Thr Asp 
325 330 



Tyr Pro Thr Arg 
335 



Thr Asn Glu Glu Leu Val Ala Phe Leu Ser Lys Asn 
340 345 

Phe Leu Lys Ser His Gly Arg Asp Asn Ser Arg Phe 
355 360 



Arg Asp Lys Asn 
350 

He Lys Lys Gin 
365 



Gly Leu Asp Arg Leu Phe His Glu Cys Asp Ser His Met Trp Arg Leu 
370 375 380 



Gly Glu Arg Gin He Pro Ala Gly He Val Val Asp 
385 390 395 



Gly Gly Ser Asp 
400 



Trp Phe Val Leu Thr Arg Ser Phe Val Glu Tyr Val 
405 410 



Val Tyr Thr Asp 
415 



Asp Pro Leu Val Ala Gin Leu Arg Gin Phe Tyr Thr 
420 425 



Tyr Thr Leu Leu 
430 



Pro Ala Glu Ser Phe Phe His Thr Val Leu Glu Asn 
435 440 



Ser Leu Ala Cys 
445 



Glu Thr Leu Val Asp Asn Asn Leu Arg Val Thr Asn Trp Asn Arg Lys 
450 455 460 



Leu Gly Cys Lys Cys Gin Tyr Lys His He Val Asp 
465 470 475 



Trp Cys Gly Cys 
480 



Ser Pro Asn Asp Phe Lys Pro Gin Asp Phe Leu Arg 
485 490 



Leu Gin Gin Val 
495 



Ser Arg Pro Thr Phe Phe Ala Arg Lys Phe Glu Ser 
500 505 



Thr Val Asn Gin 
510 



Glu Val Leu Glu He Leu Asp Phe His Leu Tyr Gly 
515 520 



Ser Tyr Pro Pro 
525 



Gly Thr Pro Ala Leu Lys Ala Tyr Trp Glu Asn Thr Tyr Asp Ala Ala 
530 535 540 



Asp Gly Pro Ser Gly Leu Ser Asp Val Met Leu Thr 
545 550 555 



Ala Tyr Thr Ala 
560 



Phe Ala Arg Leu Ser Leu His His Ala Ala Thr Ala 
565 570 



Ala Pro Pro Met 
575 



Gly Thr Pro Leu Cys Arg Phe Glu Pro Arg Gly Leu 
580 585 



Pro Ser Ser Val 
590 
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His Leu Tyr Phe Tyr Asp Asp His Phe Gin Gly Tyr Leu Val Thr Gin 
595 600 ^ 605 

Ala Val Gin Pro Ser Ala Gin Gly Pro Ala Glu Thr Leu Glu Met Trp 
610 615 620 

Leu Met Pro Gin Gly Ser Leu Lys Leu Leu Gly Arg Ser Asp Gin Ala 
625 630 635 . 640 

Ser Arg Leu Gin Ser Leu Glu Val Gly Thr Asp Trp Asp Pro Lys Glu 
645 650 655 

Arg Leu Phe Arg Asn Phe Gly Gly Leu Leu Gly Pro Leu Asp Glu Pro 
660 * 665 670 

Val Ala Val Gin Arg Trp Ala Arg Gly Pro Asn Leu Thr Ala Thr Val 
675 680 685 

Val Trp lie Asp Pro Thr Tyr Val Val Ala Thr Ser Tyr Asp lie Thr 
690 695 700 

Val Asp Thr Glu Thr Glu Val Thr Gin Tyr Lys Pro Pro Leu Ser Arg 
705 710 715 720 

Pro Leu Arg Pro Gly Pro Trp Thr Val Arg Leu Leu Gin Phe Trp Glu 
725 730 735 

Pro Leu Gly Glu Thr Arg Phe Leu Val Leu Pro Leu Thr Phe Asn Arg 
740 745 750 

Lys Leu Pro Leu Arg Lys Asp Asp Ala Ser Trp Leu His Ala Gly Pro 
755 760 765 

Pro His Asn Glu Tyr Met Glu Gin Ser Phe Gin Gly Leu Ser Ser lie 
770 775 780 

Leu Asn Leu Pro Gin Pro Glu Leu Ala Glu Glu Ala Ala Gin Arg His 
785 790 795 800 

Thr Gin Leu Thr Gly Pro Ala Leu Glu Ala Trp Thr Asp Arg Glu Leu 
805 810 815 

Ser Ser Phe Trp Ser Val Ala Gly Leu Cys Ala lie Gly Pro Ser Pro 
820 825 830 

Cys Pro Ser Leu Glu Pro Cys. Arg Leu Thr Ser Trp Ser Ser Leu Ser 
835 840 845 

Pro Asp Pro Lys Ser Glu Leu Gly Pro Val Lys Ala Asp Gly Arg Leu 
850 855 860 



Arg 




865 




<210> 


5 


<211> 


2463 


<212> 


DNA 


<213> 


Rat coronavirus 


<220> 




<221> 


CDS 


<222> 


(1).. (2463) 


<223> 


rat XT-I 


<400> 


5 
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tac ttc tct cac egg ccc aaa gag aaa gtt egg acc gac age aac aat 48 
Tyr Phe Ser His Arg Pro Lys Glu Lys Val Arg Thr Asp Ser Asn Asn 
15 10 15 

gag aac tea gtc ccc aag gac ttt gag aat gtg gac aac age aac ttc 96 
Glu Asn Ser Val Pro Lys Asp Phe Glu Asn Val Asp Asn Ser Asn Phe 
20 25 30 

gca ccc agg act caa aag cag aag cac cag cca gag ttg gca aag aag 144 
Ala Pro Arg Thr Gin Lys Gin Lys His Gin Pro Glu Leu Ala Lys Lys 
35 40 45 

ccc etc age agg cag aaa gag cgt ttg cag aga aag ctg ggc gec cag 192 
Pro Leu Ser Arg Gin Lys Glu Arg Leu Gin Arg Lys Leu Gly Ala Gin 
50 55 60 

gac aaa gga cag ggg cag tea gtc eta gga aaa ggc ccc aag gag gtg 240 
Asp Lys Gly Gin Gly Gin Ser Val Beu Gly Lys -Gly Pro Lys Glu Val 
65 " 70 75 80 

ctg cct cct egg gaa aaa gee cca ggc aac agt age caa ggg aag gat 288 
Leu Pro Pro Arg Glu Lys Ala Pro Gly Asn Ser Ser Gin Gly Lys Asp 
85 90 95 

etc tea aga cac age cat age agg aag agt ggt ggc ggt ggg tec ccg 336 
Leu Ser Arg His Ser His Ser Arg Lys Ser Gly Gly Gly Gly Ser Pro 
100 105 110 

gaa act aag tct gac cag gtc ccc aag tgt gac ate tct ggc aag gag 384 
Glu Thr Lys Ser Asp Gin Val Pro Lys Cys Asp lie Ser Gly Lys Glu 
115 120 125 

gee ate tea gcg ctg acc cgc get aag tec aag cac tgt cgc cag gag 432 
Ala lie Ser Ala Leu Thr Arg Ala Lys Ser Lys His Cys Arg Gin Glu 
130 135 140 

att gca gaa. acc tac tgt cgc cac aag ctg ggg ctg ctg atg cca gag 480 
lie Ala Glu Thr Tyr Cys Arg His Lys Leu Gly Leu Leu Met Pro Glu 
145 150 ~ 155 160 

aag gtg get cga ttc tgt ccc ctg gaa ggc aaa gee aac aag aat gtc 528 
Lys Val Ala Arg Phe Cys Pro Leu Glu Gly Lys Ala Asn Lys Asn Val 
165 170 " 175 

cag tgg gat gag gat get gtt gag tac atg ccc ccc aac cca gtc aga 576 
Gin Trp Asp Glu Asp Ala Val Glu Tyr Met Pro Pro Asn Pro Val Arg 
180 185 190 

ate gee ttt gtc ctg gtg gtc cat ggc cgt gee tct cga cag ctg cag 624 
lie Ala Phe Val Leu Val Val His Gly Arg Ala Ser Arg Gin Leu Gin 
195 200 205 

cgc atg ttt aag gec ate tac cac aaa gac cat ttc tac tat ate cat 672 
Arg Met Phe Lys Ala He Tyr His Lys Asp His Phe Tyr Tyr He His 
210 215 220 

gtg gat aag cgt tec aat tac ctg cat egg caa gtg etc cag ttc tec 720 
Val Asp Lys Arg Ser Asn Tyr Leu His Arg Gin Val Leu Gin Phe Ser 
225 230 235 240 



agg cag tac gac aac gtc cga gtc act tec tgg agg atg gee acc att 
Arg Gin Tyr Asp Asn Val Arg Val Thr Ser Trp Arg Met Ala Thr He 
245 250 " 255 



768 
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tgg ggc gga gcc age etc ctg tec act tac ctg cag age atg egg gat 816 
Trp Gly Gly Ala Ser Leu Leu Ser Thr Tyr Leu Gin Ser Met Arg Asp 
260 265 270 

eta ctg gag atg act gac tgg ccc tgg gac ttc ttc ate aac ctg agt 864 
Leu Leu Glu Met Thr Asp Trp Pro Trp Asp Phe Phe lie Asn Leu Ser 
275 280 285 

gcg get gac tac ccc ate agg aca aat gac cag etc gta gca ttt ctt 912 
Ala Ala Asp Tyr Pro lie Arg Thr Asn Asp Gin Leu Val Ala Phe Leu 
290 295 300 

tec aga tac cga gat atg aat ttc ctg aag tea cat ggt egg gac aat 960 
Ser Arg Tyr Arg Asp Met Asn Phe Leu Lys Ser His Gly Arg Asp Asn 
305 ~ 310 315 320 

gca agg ttc ate egg aag cag gac ctg gac cgt etc ttc ctg gag tgt 1008 
Ala Arg Phe He Arg Lys Gin Asp Lieu Asp Arg -Leu Phe Leu Glu Cys 
325 330 335 

gat aca cac atg tgg cgc ctg ggg gac egg cga ate cca gag ggc att 1056 
Asp Thr His Met Trp Arg Leu Gly Asp Arg Arg He Pro Glu Gly He 
340 345 350 

get gta gat ggc ggc tct gat tgg ttc ctg eta aac egg aag ttt gta 1104 
Ala Val Asp Gly Gly Ser Asp Trp Phe Leu Leu Asn Arg Lys Phe Val 
355 360 365 

gag tat gtg gca ttc tec aca gat gac ctg gtg ace aag atg aag cag 1152 
Glu Tyr Val Ala Phe Ser Thr Asp Asp Leu Val Thr Lys Met Lys Gin 
370 375 380 

ttc tac tct tat ace etc etc cct get gag tec ttt ttc cac acg gtc 1200 
Phe Tyr Ser Tyr Thr Leu Leu Pro Ala Glu Ser Phe Phe His Thr Val 
385 390 395 400 

ctg gag aac age ccc cac tgt gac ace atg gtg gat aac aac ctg cgc 124 8 
Leu Glu Asn Ser Pro His Cys Asp Thr Met Val Asp Asn Asn Leu Arg 
405 410 415 

ate acc aat tgg aac cgc aag ctg ggc tgc aag tgc cag tac aag cat 1296 
He Thr Asn Trp Asn Arg Lys Leu Gly Cys Lys Cys Gin Tyr Lys His 
420 425 430 

att gtg gac tgg tgt ggc tgc tct ccc aat gac ttc aag cct cag gac 134 4 
He Val Asp Trp Cys Gly Cys Ser Pro Asn Asp Phe Lys Pro Gin Asp 
435 440 * 445 

ttc cat cgc ttc cag cag acg gcc egg ccc acc ttc ttt gcc cga aag 1392 
Phe His Arg Phe Gin Gin Thr Ala Arg Pro Thr Phe Phe Ala Arg Lys 
450 455 460 

ttc gaa gcc ata gtg aac cag gaa ate att ggg cag ttg gac tct tac 1440 
Phe Glu Ala He Val Asn Gin Glu He He Gly Gin Leu Asp Ser Tyr 
465 470 475 480 

ctg tat ggc aac tat cct gcg ggc acc ccg ggc etc cgc tct tac tgg 1488 
Leu Tyr Gly Asn Tyr Pro Ala Gly Thr Pro Gly Leu Arg Ser Tyr Trp 
485 490 495 



gag aat gtc tat gac gaa cca gat ggc ate 
Glu Asn Val Tyr Asp Glu Pro Asp Gly He 
500 505 



cag age etc age gat gtg 
Gin Ser Leu Ser Asp Val 
510 



1536 
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gca etc acc atg tat cat tec ttc ate cgc ctg ggt ctt cga agg get 1584 
Ala Leu Thr Met Tyr His Ser Phe lie Arg Leu Gly Leu Arg Arg Ala 
515 520 " 525 

gag tea teg eta cac acg gat ggg gag aac age tgc agg tac tat cca 1632 
Glu Ser Ser Leu His Thr Asp Gly Glu Asn Ser Cys Arg Tyr Tyr Pro 
530 535 540 

atg ggc cac cca gcg tct gtc cac etc tac ttc ctg get gac cga ttc 1680 
Met Gly His Pro Ala Ser Val His Leu Tyr Phe Leu Ala Asp Arg Phe 
545 550 555 560 

cag ggc ttt ctg ate aag cac cat gtg acc aac ctt get gtg age aaa 1728 
Gin Gly Phe Leu He Lys His His Val Thr Asn Leu Ala Val Ser Lys 
565 570 575 

ctg gag aca ctg gag aca tgg atg atg cca aag aaa gtc ttc aag gtc 1776 
Leu Glu Thr Leu Glu Thr Trp Met Met Pro Lys -Lys Val Phe Lys Val 
580 585 " " 590 

gca agt ccc ccc agt gac ttt ggg agg ctt cag ttt tct gag gtt ggc 1824 
Ala Ser Pro Pro Ser Asp Phe Gly Arg Leu Gin Phe Ser Glu Val Gly 
595 600 605 



act gac tgg gat gee aag gag aga eta ttc egg aac ttt ggt ggt ctt 1872 
Thr Asp Trp Asp Ala Lys Glu Arg Leu Phe Arg Asn Phe Gly Gly Leu 
610 615 620 

ttg ggg ccc atg gat gag ccg gtg gga atg cag aaa tgg ggg aag gga 1920 
Leu Gly Pro Met Asp Glu Pro Val Gly Met Gin Lys Trp Gly Lys Gly 
625 630 ^ 635 ^ 640 

ccc aat gtg acc gtg act gtt att tgg gtg gat cct gtc aat gtc ate 1968 
Pro Asn Val Thr Val Thr Val He Trp Val Asp Pro Val Asn Val He 
645 650 655 

gca gee acc tat gat ate ctg ate gag tec act gcg gaa ttc aca cac 2016 
Ala Ala Thr Tyr Asp He Leu He Glu Ser Thr Ala Glu Phe Thr His 
660 665 670 

tac aaa ccc cct ttg aat ctg cct ctg agg cct ggt gtc tgg aca gtg 2064 
Tyr Lys Pro Pro Leu Asn Leu Pro Leu Arg Pro Gly Val Trp Thr Val 
675 680 685 

aag att etc cat cac tgg gta cca gtt gca gag acc aaa ttc ctt gtc 2112 
Lys He Leu His His Trp Val Pro Val Ala Glu Thr Lys Phe Leu Val 
690 695 700 

gca cct ctg acc ttc tec aac aag cag ccc ate aaa cca gag gag get 2160 
Ala Pro Leu Thr Phe Ser Asn Lys Gin Pro He Lys Pro Glu Glu Ala 
705 710 715 720 

ttg aag ctg cac aat ggg cca cct cgc agt gee tac atg gag cag agt 2208 
Leu Lys Leu His Asn Gly Pro Pro Arg Ser Ala Tyr Met Glu Gin Ser 
725 730 735 

ttc cag age eta aac cca gtc etc age ctg cac ate aat cct gee caa 2256 
Phe Gin Ser Leu Asn Pro Val Leu Ser Leu His He Asn Pro Ala Gin 
740 745 750 



gtg gag cag gee egg aag aac gca gee ttc act ggg aca gca eta gaa 
Val Glu Gin Ala Arg Lys Asn Ala Ala Phe Thr Gly Thr Ala Leu Glu 
755 760 765 



2304 
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gcc tgg ctg gtg ggt ggg act tgg act gcc atg gac gtc tgt gcc aca 2352 
Ala Trp Leu Val Gly Gly Thr Trp Thr Ma Met Asp Val Cys Ala Thr 
770 775 780 

ggc ccc act gcc tgc cca gtc atg caa acc tgc age caa aca gcc tgg 2400 
Gly Pro Thr Ala Cys Pro Val Met Gin Thr Cys Ser Gin Thr Ala Trp 
785 790 795 800 

agt tec ttc age cct gat ccc aag tea gag ctg ggt gca gtg aag cct 2448 
Ser Ser Phe Ser Pro Asp Pro Lys Ser Glu Leu Gly Ala Val Lys Pro 
805 810 815 

gat ggt egg etc agg 24 63 

Asp Gly Arg Leu Arg 
820 



<210> 6 
<211> 821 
<212> PRT 

<213> Rat coronavirus 
<400> 6 

Tyr Phe Ser His Arg Pro Lys Glu Lys Val Arg Thr Asp Ser Asn Asn 
15 10 15 

Glu Asn Ser Val Pro Lys Asp Phe Glu Asn Val Asp Asn Ser Asn Phe 
20 25 30 

Ala Pro Arg Thr Gin Lys Gin Lys His Gin Pro Glu Leu Ala Lys Lys 
35 40 45 

Pro Leu Ser Arg Gin Lys Glu Arg Leu Gin Arg Lys Leu Gly Ala Gin 
50 55 " 60 

Asp Lys Gly Gin Gly Gin Ser Val Leu Gly Lys Gly Pro Lys Glu Val 
65 70 ~ 75 " " 80 

Leu Pro Pro Arg Glu Lys Ala Pro Gly Asn Ser Ser Gin Gly Lys Asp 
85 90 95 

Leu Ser Arg His Ser His Ser Arg Lys Ser Gly Gly Gly Gly Ser Pro 
100 105 ~ 110 

Glu Thr Lys Ser Asp Gin Val Pro Lys Cys Asp He Ser Gly Lys Glu 
115 120 125 

Ala He Ser Ala Leu Thr Arg Ala Lys Ser Lys His Cys Arg Gin Glu 
130 135 " 140 

He Ala Glu Thr Tyr Cys Arg His Lys Leu Gly Leu Leu Met Pro Glu 
145 150 155 160 

Lys Val Ala Arg Phe Cys Pro Leu Glu Gly Lys Ala Asn Lys Asn Val 
165 170 175 

Gin Trp Asp Glu Asp Ala Val Glu Tyr Met Pro Pro Asn Pro Val Arg 
180 185 190 

He Ala Phe Val Leu Val Val His Gly Arg Ala Ser Arg Gin Leu Gin 
195 200 205 
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Arg Met Phe Lys Ala He Tyr His Lys Asp His Phe Tyr Tyr He His 
210 215 220 

Val Asp Lys Arg Ser Asn Tyr Leu His Arg Gin Val Leu Gin Phe Ser 
225 230 235 240 

Arg Gin Tyr Asp Asn Val Arg Val Thr Ser Trp Arg Met Ala Thr He 
245 250 255 

Trp Gly Gly Ala Ser Leu Leu Ser Thr Tyr Leu Gin Ser Met Arg Asp 
260 265 270 

Leu Leu Glu Met Thr Asp Trp Pro Trp Asp Phe Phe He Asn Leu Ser 
275 280 285 

Ala Ala Asp Tyr Pro He Arg Thr Asn Asp Gin Leu Val Ala Phe Leu 
290 295 300 

Ser Arg Tyr Arg Asp Met Asn Phe Leu Lys Ser His Gly Arg Asp Asn 
305 310 315 320 

Ala Arg Phe He Arg Lys Gin Asp Leu Asp Arg Leu Phe Leu Glu Cys 
325 330 335 

Asp Thr His Met Trp Arg Leu Gly Asp Arg Arg He Pro Glu Gly He 
340 345 350 

Ala Val Asp Gly Gly Ser Asp Trp Phe Leu Leu Asn Arg Lys Phe Val 
355 360 365 

Glu Tyr Val Ala Phe Ser Thr Asp Asp Leu Val Thr Lys Met Lys Gin 
370 • 375 380 

Phe Tyr Ser Tyr Thr Leu Leu Pro Ala Glu Ser Phe Phe His Thr Val 
385 390 395 400 

Leu Glu Asn Ser Pro His Cys Asp Thr Met Val Asp Asn Asn Leu Arg 
405 410 415 

He Thr Asn Trp Asn Arg Lys Leu Gly Cys Lys Cys Gin Tyr Lys His 
420 425 430 

He Val Asp Trp Cys Gly Cys Ser Pro Asn Asp Phe Lys Pro Gin Asp 
435 440 445 



Phe His Arg Phe Gin Gin Thr Ala 
450 455 

Phe Glu Ala He Val Asn Gin Glu 
465 470 

Leu Tyr Gly Asn Tyr Pro Ala Gly 
485 

Glu Asn Val Tyr Asp Glu Pro Asp 
500 

Ala Leu Thr Met Tyr His Ser Phe 
515 520 



Arg Pro Thr Phe Phe Ala Arg Lys 
460 

He He Gly Gin Leu Asp Ser Tyr 
475 480 

Thr Pro Gly Leu Arg Ser Tyr Trp 
490 495 

Gly He Gin Ser Leu Ser Asp Val 
505 510 

He Arg Leu Gly Leu Arg Arg Ala 
525 



Glu Ser Ser Leu His Thr Asp Gly Glu Asn Ser Cys Arg Tyr Tyr Pro 
530 535 540 
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Met Gly His Pro Ala Ser Val His Leu Tyr Phe Leu Ala Asp Arg Phe 
545 550 555 ' 560 

Gin Gly Phe Leu lie Lys His His Val Thr Asn Leu Ala Val Ser Lys 
565 ' 570 575 

Leu Glu Thr Leu Glu Thr Trp Met Met Pro Lys Lys Val Phe Lys Val 
580 585 590 

Ala Ser Pro Pro Ser Asp Phe Gly Arg Leu Gin Phe Ser Glu Val Gly 
595 600 605 

Thr Asp Trp Asp Ala Lys Glu Arg Leu Phe Arg Asn Phe Gly Gly Leu 
610 615 620 

Leu Gly Pro Met Asp Glu Pro Val Gly Met Gin Lys Trp Gly Lys Gly 
625 630 635 640 

Pro Asn Val Thr Val Thr Val lie Trp Val Asp Pro Val Asn Val He 
645 650 655 

Ala Ala Thr Tyr Asp He Leu He Glu Ser Thr Ala Glu Phe Thr His 
660 665 670 

Tyr Lys Pro Pro Leu Asn Leu Pro Leu Arg Pro Gly Val Trp Thr Val 
675 680 685 

Lys He Leu His His Trp Val Pro Val Ala Glu Thr Lys Phe Leu Val 
690 695 700 

Ala Pro Leu Thr Phe Ser Asn Lys Gin Pro He Lys Pro Glu Glu Ala 
705 710 715 720 

Leu Lys Leu His Asn Gly Pro Pro Arg Ser Ala Tyr Met -Glu Gin Ser 
725 730 735 

Phe Gin Ser Leu Asn Pro Val Leu Ser Leu His He Asn Pro Ala Gin 
740 745 750 

Val Glu Gin Ala Arg Lys Asn Ala Ala Phe Thr Gly Thr Ala Leu Glu 
755 760 765 

Ala Trp Leu Val Gly Gly Thr Trp Thr Ala Met Asp Val Cys Ala Thr 
770 775 780 

Gly Pro Thr Ala Cys Pro Val Met Gin Thr Cys Ser Gin Thr Ala Trp 
785 790 795 800 

Ser Ser Phe Ser Pro Asp Pro Lys Ser Glu Leu Gly Ala Val Lys Pro 
805 810 815 



Asp Gly Arg Leu Arg 
820 



<210> 7 
<211> 2858 
<212> DNA 

<213> Rat coronavirus 

<220> 

<221> CDS 

<222> (56) . . (2650) 
<223> rat XT-2 
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<400> 7 

gcgctcgcgg ccgggttgca gggccggacg cgcgccccgc gtcccgggca ggaag atg 58 

Met 
1 

gtg gcg age gcg egg gtg cag aag ctg gtg egg cgc tac aag ctg gcg 106 
Val Ala Ser Ala Arg Val Gin Lys Leu Val Arg Arg Tyr Lys Leu Ala 
5 10 15 

ate gee ace gcg ctg gec ate ctg ctg ctg cag ggc ctg gtg gtg tgg 154 
He Ala Thr Ala Leu Ala He Leu Leu Leu Gin Gly Leu Val Val Trp 
20 25 30 

age ttc age ggc ctg gag gag gac gag ccg ggc gag aaa gga agg cag 202 
Ser Phe Ser Gly Leu Glu Glu Asp Glu Pro Gly Glu Lys Gly Arg Gin 
35 40 45 

aga aag cca egg ccg eta gac cct gga gag ggt tec aag gac acg gac 
Arg Lys Pro Arg Pro Leu Asp Pro Gly Glu Gly -Ser Lys Asp Thr Asp 
50 ' 55 60 65 

age tct gca ggg cgt agg ggc agt get ggg aga agg cac gga egg tgg 298 
Ser Ser Ala Gly Arg Arg Gly Ser Ala Gly Arg Arg His Gly Arg Trp 
70 75 80 

egg ggc cgc gec gag age cca ggt gtg cct gtg gec aag gtg gtg egg 34 6 
Arg Gly Arg Ala Glu Ser Pro Gly Val Pro Val Ala Lys Val Val Arg 
85 90 95 

gca gta acc agt egg cag aga gec age cga egg gtg ccc cct gee cca 394 
Ala Val Thr Ser Arg Gin Arg Ala Ser Arg Arg Val Pro Pro Ala Pro 
100 105 110 



250 



cct cca gag gca cca ggc cgc cag aac ctg agt gga gca gca gec ggg 
Pro Pro Glu Ala Pro Gly Arg Gin Asn Leu Ser Gly Ala Ala Ala Gly 
115 . 120 125 



442 



gaa gca ctg ata ggt gee gec ggc ttc cca caa cat gga gac aca ggg 4 90 
Glu Ala Leu He Gly Ala Ala Gly Phe Pro Gin His Gly Asp Thr Gly 
130 135 140 145 

agt gtg gag ggt gec cct cag ccc acg gat aac age ttc act ccg aag 538 
Ser Val Glu Gly Ala Pro Gin Pro Thr Asp Asn Ser Phe Thr Pro Lys 
150 , 155 160 

tgt gag att gtg ggc aag gat gca ctg tea gca ctg gee egg gec age 
Cys Glu He Val Gly Lys Asp Ala Leu Ser Ala Leu Ala Arg Ala Ser 
165 170 175 

acc aag cat tgt caa caa gag att get aat gta gtg tgc ctg cac caa 634 
Thr Lys His Cys Gin Gin Glu He Ala Asn Val Val Cys Leu His Gin 
180 * 185 190 



586 



get ggg aac eta atg ccc aag tct gtg ccc egg cac tgc cag ctg get 
Ala Gly Asn Leu Met Pro Lys Ser Val Pro Arg His Cys Gin Leu Ala 
195 200 205 



682 



ggc aag gtg age cct ggc ate cag tgg gaa gag gtc egg gec cag cag 730 
Gly Lys Val Ser Pro Gly He Gin Trp Glu Glu Val Arg Ala Gin Gin 
210 215 220 225 



cct gtg agt ggc cct ctg gta cgc ate gec tac atg ctg gtg gtt cac 
Pro Val Ser Gly Pro Leu Val Arg He Ala Tyr Met Leu Val Val His 
230 235 240 



778 



WO 01/49831 PCT/EP00/13311 

22 / 32 

ggc cgt get gtg cgc cag ctg aag cgt ctt ctg aag gec gtc tac cac 826 
Gly Arg Ala Val Arg Gin Leu Lys Arg Leu Leu Lys Ala Val Tyr His 
245 250 4 255 

gag gag cac ttc ttt tat att cat gtg gac aag cgt tec aac tac ctg 87 4 
Glu Glu His Phe Phe Tyr lie His Val Asp Lys Arg Ser Asn Tyr Leu 
260 265 270 

tac egg gag gtg gta gag ctg gee cag cac tac gac aat gta egg gta 922 
Tyr Arg Glu Val Val Glu Leu Ala Gin His Tyr Asp Asn Val Arg Val 
275 280 285 

aca cct tgg cgc atg gtc ace ate tgg ggt ggg get age ctt ctg agg 970 
Thr Pro Trp Arg Met Val Thr lie Trp Gly Gly Ala Ser Leu Leu Arg 
290 295 300 305 

atg tac ctg egg age atg aag gac ctt ctg gag act cct ggc tgg ace 1018 
Met Tyr Leu Arg Ser Met Lys Asp teu Leu Glu -Thr Pro Gly Trp Thr 
310 315 320 

tgg gac ttc ttc ate aac ctg age get act gac tat cca ace agg acg 1066 
Trp Asp Phe Phe lie Asn Leu Ser Ala Thr Asp Tyr Pro Thr Arg Thr 
325 330 335 

aat gag gag ctg gta gcg ttc tta tec aag aac egg gac aag aat ttc 1114 
Asn Glu Glu Leu Val Ala Phe Leu Ser Lys Asn Arg Asp Lys Asn Phe 
340 345 350 

etc aag tea cac ggg cga gac aat tec agg ttc ate aag aaa caa ggc 1162 
Leu Lys Ser His Gly Arg Asp Asn Ser Arg Phe lie Lys Lys Gin Gly 
355 * 360 365 

ctg gac egg ctt ttc cat gag tgt gat tec cac atg tgg cgc ctg ggt 1210 
Leu Asp Arg Leu Phe His Glu Cys Asp Ser His Met Trp Arg Leu Gly 
370 * 375 380 385 

gaa egg cag ate ccg gca ggc ate gtg gtg gat ggt ggc tct gac tgg 1258 
Glu Arg Gin lie Pro Ala Gly He Val Val Asp Gly Gly Ser Asp Trp 
390 395 400 

ttc gtg ctg aca cgc age ttt gtg gaa tat gtg gtg tat aca gag gat 1306 
Phe Val Leu Thr Arg Ser Phe Val Glu Tyr .Val Val Tyr Thr Glu Asp 
405 410 415 

cct ctg gtg gee cag ctt cgc cag ttc tat aca tac aca ttg ctt cca 1354 
Pro Leu Val Ala Gin Leu Arg Gin Phe Tyr Thr Tyr Thr Leu Leu Pro 
420 425 . 430 

gee gag tec ttc ttc cac aca gtg ctg gag aac age cca gee tgt gag 1402 
Ala Glu Ser Phe Phe His Thr Val Leu Glu Asn Ser Pro Ala Cys Glu 
435 440 445 

age eta gtg gac aac aac ctg egg gtt acc aac tgg aac egg aag ctg 1450 
Ser Leu Val Asp Asn Asn Leu Arg Val Thr Asn Trp Asn Arg Lys Leu 
450 455 460 465 

ggc tgc aag tgc cag tac aag cac ate gtg gac tgg tgt ggc tgc tec 14 98 
Gly Cys Lys Cys Gin Tyr Lys His He Val Asp Trp Cys Gly Cys Ser 
470 ~ ' 475 480 



ccc aac gac ttc aag cca cag gac ttc ctg egg ctt cag caa gtc tec 
Pro Asn Asp Phe Lys Pro Gin Asp Phe Leu Arg Leu Gin Gin Val Ser 
485 " 490 495 
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aga ccc acc ttc ttt gcc egg aag ttt gag teg act gtg aac cag gaa 1594 
Arg Pro Thr Phe Phe Ala Arg Lys Phe Glu Ser Thr Val Asn Gin Glu 
500 505 510 

gtc ctg gaa att ttg gac ttc cac ctg tat ggc age tac cca ccc ggc 1642 
Val Leu Glu lie Leu Asp Phe His Leu Tyr Gly Ser Tyr Pro Pro Gly 
515 * 520 525 

acc cca gcc etc aag gcc tac tgg gag aac ate tac gac atg gcc gat 1690 
Thr Pro Ala Leu Lys Ala Tyr Trp Glu Asn lie Tyr Asp Met Ala Asp 
530 535 540 545 

ggc cct agt gga etc age gat gtc eta etc act get tac aca gcc ttt 1738 
Gly Pro Ser Gly Leu Ser Asp Val Leu Leu Thr Ala Tyr Thr Ala Phe ' 
550 555 560 

gcc cgt ate agt ctg cgt cat get gcc act gtt tec cca ctg gcc act 1786 
Ala Arg lie Ser Leu Arg His Ala A\La Thr Val -Ser Pro Leu Ala Thr 
565 570 575 

gca gtc tgc agg ttt gag ccc agg ggg ttg ccg tec age gtg cac ctg 1834 
Ala Val Cys Arg Phe Glu Pro Arg Gly Leu Pro Ser Ser Val His Leu 
580 585 590 

tat ttc tat gac gac cat ttc cag ggc tac ctg gtg acg cag gca gtg 1882 
Tyr Phe Tyr Asp Asp His Phe Gin Gly Tyr Leu Val Thr Gin Ala Val 
595 600 605 

cag ccc tea gcc cag ggg cca gca gag aca ctt gag atg tgg ctg atg 1930 
Gin Pro Ser Ala Gin Gly Pro Ala Glu Thr Leu Glu Met Trp Leu Met 
610 615 620 625 

ccc cag agg ttg ctg aag ccg ttg ggg cac agt gac cag gcc age egg 197 8 
Pro Gin Arg Leu Leu Lys Pro Leu Gly His Ser Asp Gin Ala Ser Arg 
630 635 640 

etc cag agt ctg gag gtt ggc act gag tgg gac ccc aaa gaa cgt etc 2026 
Leu Gin Ser Leu Glu Val Gly Thr Glu Trp Asp Pro Lys Glu Arg Leu 
645 650 655 

ttc egg aac ttt ggg ggc ctg ttg gga cca ctg gat gaa cct gtg gcc 2074 
Phe Arg Asn Phe Gly Gly Leu Leu Gly Pro Leu Asp Glu Pro Val Ala 
660 665 670 

atg cag cgc tgg gcc egg ggc ccc aac etc aca gcc act gtg gtc tgg 2122 
Met Gin Arg Trp Ala Arg Gly Pro Asn Leu Thr Ala Thr Val Val Trp 
675 680 685 

att gac ccc acc tat gtt gtg gcc aca tec tat gac ate acg gta gat 2170 
lie Asp Pro Thr Tyr Val Val Ala Thr Ser Tyr Asp He Thr Val Asp 
690 695 700 705 

gcg gac act gaa gtc acg cag tac aag ccc cca ctg age ctg cca ctg 2218 
Ala Asp Thr Glu Val Thr Gin Tyr Lys Pro Pro Leu Ser Leu Pro Leu 
710 715 720 

egg cca gga gcc tgg act gtt cga ttg ctt cag ttc tgg gag ccc ctg 2266 
Arg Pro Gly Ala Trp Thr Val Arg Leu Leu Gin Phe Trp Glu Pro Leu 
725 730 735 

ggt gag acc cgc ttc etc gtg ctg cca ttg acc ttc aac cac aaa eta 2314 
Gly Glu Thr Arg Phe Leu Val Leu Pro Leu Thr Phe Asn His Lys Leu 
740 745 750 
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cct etc agg aaa gat gat gec age tgg ctg cat gcg gga cca ccc cac 2362 
Pro Leu Arg Lys Asp Asp Ala Ser Trp Leu His Ala Gly Pro Pro His 
755 760 765 

aac gaa tac atg gaa cag agt ttc cag gga eta agt ggc ate ctg aat 2410 
Asn Glu Tyr Met Glu Gin Ser Phe Gin Gly Leu Ser Gly lie Leu Asn 
770 775 780 785 

ctg cct cag cca gag gee gtg gag gag get gee egg egg cac aca gag 24 58 
Leu Pro Gin Pro Glu Ala Val Glu Glu Ala Ala Arg Arg His Thr Glu 
790 795 800 

etc aca ggt cct gca ctt gag gec tgg aca gat ggg gaa ctg age agt 2506 
Leu Thr Gly Pro Ala Leu Glu Ala Trp Thr Asp Gly Glu Leu Ser Ser 
805 810 815 

ttc tgg tct gtt gca gga ttg tgt gee ata ggg cct tct tct tgt ccc 2554 
Phe Trp Ser Val Ala Gly Leu Cys ftla lie Gly -Pro Ser Ser Cys Pro 
820 825 830 

tec ctg gag etc tgc aga ctg acc age tgg age tct ctg tct cct gac 2602 
Ser Leu Glu Leu Cys Arg Leu Thr Ser Trp Ser Ser Leu Ser Pro Asp 
835 840 845 

ccc aag tea gag ctg ggg cct gtc aaa get gac gga cga etc agg tag 2650 
Pro Lys Ser Glu Leu Gly Pro Val Lys Ala Asp Gly Arg Leu Arg 
850 855 860 865 

caggacccca gccagcacaa cceggaggag ceggggaatt gcaccttaca gacaatggag 2710 

ggaegtctet cctctggtag ccagggagcc ctagaacagt ggtggggctg tctgtctgtc 2770 

aggctttcct aggaaatgga cgacatcagg ccctgctgag atccatgaag tttccccact 2830 

ggaggaggga gccccactgg cagcacag 2858 



<210> 8 
<211> 864 
<212> PRT 

<213> Rat coronavirus 
<400> 8 

Met Val Ala Ser Ala Arg Val Gin Lys Leu Val Arg Arg Tyr Lys Leu 
15 10 15 

Ala He Ala Thr Ala Leu Ala He Leu Leu Leu Gin Gly Leu Val Val 
20 25 30 

Trp Ser Phe Ser Gly Leu Glu Glu Asp Glu Pro Gly Glu Lys Gly Arg 
35 40 45 

Gin Arg Lys Pro Arg Pro Leu Asp Pro Gly Glu Gly Ser Lys Asp Thr 
50 55 60 

Asp Ser Ser Ala Gly Arg Arg Gly Ser Ala Gly Arg Arg His Gly Arg 
65 70 75 80 

Trp Arg Gly Arg Ala Glu Ser Pro Gly Val Pro Val Ala Lys Val Val 
85 90 95 

Arg Ala Val Thr Ser Arg Gin Arg Ala Ser Arg Arg Val Pro Pro Ala 
100 105 110 

Pro Pro Pro Glu Ala Pro Gly Arg Gin Asn. Leu Ser Gly Ala Ala Ala 
115 120 125 
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Gly Glu Ala Leu lie Gly Ala Ala Gly Phe Pro Gin His Gly Asp Thr 
130 135 140 

Gly Ser Val Glu Gly Ala Pro Gin Pro Thr Asp Asn Ser Phe Thr Pro 
145 150 155 160 

Lys Cys Glu lie Val Gly Lys Asp Ala Leu Ser Ala Leu Ala Arg Ala 
165 170 175 

Ser Thr Lys His Cys Gin. Gin Glu He Ala Asn Val Val Cys Leu His 
180 185 190 

Gin Ala Gly Asn Leu Met Pro Lys Ser Val Pro Arg His Cys Gin Leu 
195 200 205 

Ala Gly Lys Val Ser Pro Gly He Gin Trp Glu Glu Val Arg Ala Gin 
210 215 220 

Gin Pro Val Ser Gly Pro Leu Val Arg He Ala Tyr Met Leu Val Val 
225 230 235 240 

His Gly Arg Ala Val Arg Gin Leu Lys Arg Leu Leu Lys Ala Val Tyr 
245 250 255 

His Glu Glu His Phe Phe Tyr He His Val Asp Lys Arg Ser Asn Tyr 
260 265 270 

Leu Tyr Arg Glu Val Val Glu Leu Ala Gin His Tyr Asp Asn Val Arg 
275 280 " 285 

Val Thr Pro Trp Arg Met Val Thr He Trp Gly Gly Ala Ser Leu Leu 
290 295 300 

Arg Met Tyr Leu Arg Ser Met Lys Asp Leu Leu Glu Thr Pro Gly Trp 
305 310 315 320 

Thr Trp Asp Phe Phe He Asn Leu Ser Ala Thr Asp Tyr Pro Thr Arg 
325 330 335 

Thr Asn Glu Glu Leu Val Ala Phe Leu Ser Lys Asn Arg Asp Lys Asn 
340 345 350 

Phe Leu Lys Ser His Gly Arg Asp Asn Ser Arg Phe He Lys Lys Gin 
355 360 365 

Gly Leu Asp Arg Leu Phe His Glu Cys Asp Ser His Met Trp Arg Leu 
370 " 375 380 

Gly Glu Arg Gin He Pro Ala Gly He Val Val Asp Gly Gly Ser Asp 
385 390 395 " 400 

Trp Phe Val Leu Thr Arg Ser Phe Val Glu Tyr Val Val Tyr Thr Glu 
405 410 415 

Asp Pro Leu Val Ala Gin Leu Arg Gin Phe Tyr Thr Tyr Thr Leu Leu 
420 425 * 430 

Pro Ala Glu Ser Phe Phe His Thr Val Leu Glu Asn Ser Pro Ala Cys 
435 440 445 



Glu Ser Leu Val Asp Asn Asn Leu Arg Val Thr Asn Trp Asn Arg Lys 
450 455 460 
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Leu Gly Cys Lys Cys Gin Tyr Lys His lie Val Asp Trp Cys Gly Cys 
465 470 475 480 

Ser Pro Asn Asp Phe Lys Pro Gin Asp Phe Leu Arg Leu Gin Gin Val 
485 490 495 

Ser Arg Pro Thr Phe Phe Ala Arg Lys Phe Glu Ser Thr Val Asn Gin 
500 505 510 

Glu Val Leu Glu lie Leu Asp Phe His Leu Tyr Gly Ser Tyr Pro Pro 
515 520 525 

Gly Thr Pro Ala Leu Lys Ala Tyr Trp Glu Asn lie Tyr Asp Met Ala 
530 535 540 

Asp Gly Pro Ser Gly Leu Ser Asp Val Leu Leu Thr Ala Tyr Thr Ala 
545 550 555 560 

Phe Ala Arg lie Ser Leu Arg His Ala Ala Thr Val Ser Pro Leu Ala 
565 570 575 

Thr Ala Val Cys Arg Phe Glu Pro Arg Gly Leu Pro Ser Ser Val His 
580 585 590 

Leu Tyr Phe Tyr Asp Asp His Phe Gin Gly Tyr Leu Val Thr Gin Ala 
595 600 605 

Val Gin Pro Ser Ala Gin Gly Pro Ala Glu Thr Leu Glu Met Trp Leu 
610 615 620 

Met Pro Gin Arg Leu Leu Lys Pro Leu Gly His Ser Asp Gin Ala Ser 
625 630 635 640 

Arg Leu Gin Ser Leu Glu Val Gly Thr Glu Trp Asp Pro Lys Glu Arg 
645 650 655 

Leu Phe Arg Asn Phe Gly Gly Leu Leu Gly Pro Leu Asp Glu Pro Val 
660 665 670 

Ala Met Gin Arg Trp Ala Arg Gly Pro Asn Leu Thr Ala Thr Val Val 
675 680 685 

Trp lie Asp Pro Thr Tyr Val Val Ala Thr Ser Tyr Asp lie Thr Val 
690 695 700 

Asp Ala Asp Thr Glu Val Thr Gin Tyr Lys Pro Pro Leu Ser Leu Pro 
705 710 715 720 

Leu Arg Pro Gly Ala Trp Thr Val Arg Leu Leu Gin Phe Trp Glu Pro 
725 730 735 

Leu Gly Glu Thr Arg Phe Leu Val Leu Pro Leu Thr Phe Asn His Lys 
740 745 750 

Leu Pro Leu Arg Lys Asp Asp Ala Ser Trp Leu His Ala Gly Pro Pro 
755 ~ ' 760 765 

His Asn Glu Tyr Met Glu Gin Ser Phe Gin Gly Leu Ser Gly lie Leu 
770 775 780 



Asn Leu Pro Gin Pro Glu Ala Val Glu Glu Ala Ala Arg Arg His Thr 
785 790 795 800 
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Glu Leu Thr Gly Pro Ala Leu Glu Ala Trp Thr Asp Gly Glu Leu Ser 
805 810 " 815 

Ser Phe Trp Ser Val Ala Gly Leu Cys Ala lie Gly Pro Ser Ser Cys 
820 825 830 

Pro Ser Leu Glu Leu Cys Arg Leu Thr Ser Trp Ser Ser Leu Ser Pro 
835 840 ' 845 

Asp Pro Lys Ser Glu Leu Gly Pro Val Lys Ala Asp Gly Arg Leu Arg 
850 855 860 



<210> 9 
<211> 17 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: peptide 
<400> 9 

Cys Ser Arg Gin Lys Glu Leu Leu Lys Arg Lys Leu Glu Gin Gin Glu 
1 5 10 15 

Lys 



<210> 10 
<211> 11 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: peptide 
<400> 10 

Gin Glu Glu Glu Gly Ser Gly Gly Gly Gin Lys 
15 10 

<210> 11 
<211> 12 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: peptide 
<400> 11 

Ser Glu Asn Glu Gly Ser Gly Met Ala Glu Gin Lys 
1 5 10 

<210> 12 
<211> 12 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: peptide 
<400> 12 

Thr Glu Asn Glu Gly Ser Gly Leu Thr Asn lie Lys 
15 10 
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<210> 13 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial. Sequence : primer 
<400> 13 

gtgggtatgc agaagtgggg gaagggac 

<210> 14 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 14 

ccctccgcaa tgcctaca 

<210> 15 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 15 

actgaggtca cgcaatacaa 

<210> 16 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 16 

gccgcactca ggttgatgaa gaagt 

<210> 17 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 17 

accaccagga caaaggcgat tctga 



<210> 18 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
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<400> 18 

agtcgaacag tccagggccc 

<210> 19 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 19 

cacgatctcg cacttggggg 

<210> 20 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 20 

ctattcgatg atgaagatac cccaccaaac cc 



<210> 21 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 21 

ggaagagctg ggtgtggaat 



<210> 22 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 22 

tttcccgttg agatcctgct 



<210> 23 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 23 

acagacagca acaacgagaa 
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<210> 24 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 24 

aaaggaaggc agaggaagc 

<210> 25 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 25 

acccctccac tgtctgtaag 

<210> 26 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; digestion 
peptide 

<400> 26 

Glu Leu Gly Ala Lys 
1 5 

<210> 27 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: digestion 
peptide 

<400> 27 
Glu Leu Leu Lys 
1 

<210> 28 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: digestion 
peptide 



<400> 28 

Asp Met Asn Phe Leu Lys 
1 5 
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<210> 29 
<211> 10 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: digestion 
peptide 

<400> 29 

lie Ala Ser Pro Pro Ser Asp Phe Gly Arg 
1 5 10 

<210> 30 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220>' 

<223> Description of Artificial Sequence: digestion 
peptide 

<400> 30 

Leu Leu Leu Asp 

1 

<210> 31 
<211> 13 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: digestion 
peptide 

<400> 31 

Asp Phe Glu Asn Val Asp Asn Ser Asn Phe Ala Pro Arg 
1 5 10 

<210> 32 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: digestion 
peptide 

<400> 32 

Pro Thr Phe Phe Ala Arg 
1 5 

<210> 33 
<211> 12 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: digestion 
peptide 



<400> 33 

Leu Gin Phe Ser Glu Val Gly Thr Asp Xaa Asp Ala 
1 5 10 
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<210> 34 
<211> 11 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: digestion 
peptide 

<400> 34 

Glu Leu Gly Ala Val Lys Pro Asp Gly Arg Leu 
15 10 

<210> 35 
<211> 12 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: digestion 
peptide 

<400> 35 

Glu Leu Leu Lys Arg Lys Leu Glu Gin Gin Glu Lys 
1 5 10 

<210> 36 
<211> 8 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: digestion 
peptide 

<400> 36 

Leu Gly Leu Leu Met Pro Glu Lys 
1 5 



